Skip to main content
Participant
July 24, 2023
Question

Is there a way to extract the specific content related to a tagged PDF element?

  • July 24, 2023
  • 1 reply
  • 917 views

There are many developer SDKs that can extract the tag tree of a tagged PDF (a PDF with accessibility markup), and there are many that can extract the PDF content as one big text string.

 

However I need to find the specific content which each tag points to in the PDF. I know Acrobat has this data because you can clearly see it in the tag tree in Acrobat (picture attached), but is this programmatically possible through an API?

 

thank you

This topic has been closed for replies.

1 reply

Bernd Alheit
Community Expert
Community Expert
July 24, 2023

It is possible when you create a plugin (written in C/C++) for Adobe Acrobat.

Participant
July 24, 2023

Can you explain a bit? Is there a code example or API for this? Why only in a plugin?

Bernd Alheit
Community Expert
Community Expert
July 24, 2023

"Why only in a plugin?"

It is not possible with IAC, OLE, or Javascript.