We have a brand new look! Take a tour with us and explore the latest updates on Adobe Support Community.
I have a 500-page medical record that seems to have a lot of duplicate pages. Is there a program that would scan through the document and identify the identical pages? I don't need to remove them just ID them. I cannot believe that there isn't something readily available.
Hope you are doing well and sorry for the trouble. As described you want to find the duplicate pages in the PDF file.
In Acrobat, there is no option to recognize the duplicate pages automatically. Please check out the correct answer marked in a similar discussion https://community.adobe.com/t5/adobe-acrobat-online/adobe-acrobat-pro-dc-how-to-delete-duplicate-pag... and see if that helps.
That would be a really useful feature!
Thanks for the infomation but I'd rather not.
This is an extremely complicated task. One possible way would be to export all the pages as images and then use an application that can compare those. Another would be to use a plugin or a script to compare the textual contents of the pages, although doing that with 500 pages that all have to be compared to one another will probably be too much for Acrobat to handle. A stand-alone tool is more suited for that task.
It's not necessarily extremely difficult. In this case a general purpose tool for comparing pages needed. It's a limited context so there must be a simlper identifying feature, such as text on a small section of the page, or even something as simple as the bounding box.