Skip to main content
Known Participant
September 20, 2022
Question

Extract Selected Pages - Automation

  • September 20, 2022
  • 1 reply
  • 850 views

Hey everyone, 

 

here's what I'm trying to do:


I have a 400 pages PDF from which I want to extract approximately 100 pages containing a specific word in it and I want to delete all of the others.

Is there a way to do that automatically without manually selecting all of the pages I want to get rid of?

This topic has been closed for replies.

1 reply

try67
Community Expert
Community Expert
September 20, 2022

Yes, this can be done using a script. There are various approaches to do it. For such a large file I would not recommend trying to search it using the script, as it's very likely to halt mid-process. I would use the built-in Advanced Search command to locate the instances where this word appears and then use that information to export those pages. Another option is to use the built-in Search & Remove Text command to achieve something similar, as it highlights all matches for redaction, which can then be used to identify the pages they're on (without actually redacting them, of course).

 

I've actually developed the former script as a paid-for tool, if you're interested. You can find it here:

https://www.try67.com/tool/acrobat-print-or-extract-pages-from-csv-search-results

 

Known Participant
September 20, 2022

@try67 , thank you very much for your support.


I've used the Advanced Search method and located the istances, but it's still not clear how to get a list of the pages in which the word appears to be in. In the whole document there are 380 istances and I'm assuming they can be found in approximately 90-100 pages.

How can I get a list of these pages?

try67
Community Expert
Community Expert
September 20, 2022

You can export the results as a CSV file, and then process that file to get a list of the page numbers with matches. Notice that there can be duplicates in it.