Skip to main content
Galeodan
Known Participant
June 10, 2026
Question

How do I listt all media used on a website so I can purge redundant files from the server?

  • June 10, 2026
  • 2 replies
  • 35 views

I have been building and maintaining a couple of websites for my family business. Over the years, and after many updates, there are a lot of redundant media files on the servers (local and public).  Is there an app that will analyse the website and list the media used? Then I could purge the server media and upload only those needed.

I tried this years ago using Dreamweaver to “put” files on the server, but it was not reliable. For example: it would miss alternative images in <source> tags.

    2 replies

    Nancy OShea
    Community Expert
    Community Expert
    June 11, 2026
    Try Xenu Link Sleuth by Snafu. It can crawl your live site and report all links & media files referenced in your pages. 

     

    Make reliable backups of your site before deleting files from your server. 

     

    Good luck.

     

    Nancy O'Shea— Product User & Community Expert
    Galeodan
    GaleodanAuthor
    Known Participant
    June 11, 2026

    Thanks Nancy - I’ll check it out

    Community Expert
    June 10, 2026

    Hi ​@Galeodan ,

    Some files may be referenced directly in HTML, others through CSS or JavaScript, or sometimes even from a database. Depending on how the site is built, a tool may find some of those references and completely miss others.
    Your remark about <source> tags is actually a good example of that 🙂


    Just wondering, what kind of website are we talking about? A mostly static site, a CMS, or something more dynamic? The reason I'm asking is that "unused media" can be surprisingly difficult to determine with certainty.
    Roughly how large is the site? A few dozen pages, a few hundred, or more?

    Lena

    Galeodan
    GaleodanAuthor
    Known Participant
    June 10, 2026

    Hi Lena. Thanks for the quick response. I have two main websites, and each of these has about 40 pages. They are very basic - Authored from scratch, using only HTML, CSS, and a smattering of JS. I find little time to work on them, but I have enjoyed the process of creating them while learning the basics of coding (had some free time during Covid). 

    The pages are sorely in need of improvement with regards to SEO etc., but my immediate concern is to tidy up the mass of images that I have, locally and online.

    I have been looking at Screaming Frog, to see if that could help, although much of it’s capabilities are beyond my comprehension. 

    Sean

    Community Expert
    June 11, 2026

      

    That actually helps clarify things 🙂

    With a relatively small static site built mostly with HTML, CSS and basic JS, the problem becomes much more manageable than I initially imagined.

    In that context, as proposed by ​@Nancy OShea tools like Xenu or Screaming Frog may indeed get you surprisingly close to what you're looking for, since most of the references are likely to be directly visible in the pages themselves.

    The challenge then becomes less "finding what is used" and more "identifying what is not used". Once you have a list of the files actually referenced by the site, the task is simply to compare that list with the files present locally. A file comparison tool such as UltraCompare could do that very efficiently: the referenced files are kept, while the unreferenced ones are identified and removed from the local folders.

    One important precaution, however: before deleting anything locally or synchronising the local copy with the server, make absolutely sure that your local site is fully up to date with the live version. The online site is the version currently doing the job, so if there is any doubt, download and synchronise from the server first. Otherwise, you risk removing files that are still being used on the live site but are missing from an outdated local copy.

    Once you are confident that the local and live versions match, you can proceed with the cleanup. After that, synchronising the site, using DW, would remove from the server any files that no longer exist locally (although empty folders would likely need to be deleted manually, as Dreamweaver does not usually remove them automatically).