Skip to main content
Participant
July 24, 2023
Question

Is there a way to extract the specific content related to a tagged PDF element?

  • July 24, 2023
  • 1 reply
  • 922 views

There are many developer SDKs that can extract the tag tree of a tagged PDF (a PDF with accessibility markup), and there are many that can extract the PDF content as one big text string.

 

However I need to find the specific content which each tag points to in the PDF. I know Acrobat has this data because you can clearly see it in the tag tree in Acrobat (picture attached), but is this programmatically possible through an API?

 

thank you

This topic has been closed for replies.

1 reply

Bernd Alheit
Community Expert
Community Expert
July 24, 2023

It is possible when you create a plugin (written in C/C++) for Adobe Acrobat.

Participant
July 24, 2023

Can you explain a bit? Is there a code example or API for this? Why only in a plugin?

Thom Parker
Community Expert
Community Expert
July 24, 2023
quote

Can you explain a bit? Is there a code example or API for this? Why only in a plugin?


By @Iddo31257976zlkh


Haha haha, Ask Adobe why. We just use what they give us. 

 

Download the Acrobat SDK and look at the provided examples. Read the documentation. 

https://opensource.adobe.com/dc-acrobat-sdk-docs/library/plugin/index.html

 

 

Thom Parker - Software Developer at PDFScriptingUse the Acrobat JavaScript Reference early and often