Skip to main content
Participant
October 10, 2024
Question

Extract content based on topic

  • October 10, 2024
  • 1 reply
  • 236 views

I manage multiple contracts that are all pdfs. Although all the language in each contract is unique, the topics covered in each contract are the same. For instance, each contract has an Article that specifies the term of the agreement. 

What I would like to do is extract content from each document by topic. Using the example above, I would like to export each Article that relates to the term across several different documents. I've tried exploring bookmarks, tags (I was thinking I could tag the content like HTML, but I could figure out how to wrap content in a custom tag), playing with search Indexing options and even some rudimentary javascript. It seems like there should be a way to tag content in documents and then search or extract by tag. Any help or ideas are greatly appreciated. Thank you!

This topic has been closed for replies.

1 reply

try67
Community Expert
Community Expert
October 10, 2024

Tagging can be done in multiple ways: You can add it as a metadata value to the file, as a bookmark, comment, field, etc. I think bookmarks are probably the best way of doing it.

Once tagged, the files can be searched using a script which will identify these tags and extract the sections you're interested in to a new file. This will require the development of a custom-made script, though. It's not a built-in feature of Acrobat.

 

If you're interested in hiring a professional to create it for you, feel free to contact me privately by clicking my user-name and then on the blue "Message" button.