Copy link to clipboard
Copied
Dear all
is it possible we can extract all text from a page of PDF with text location or coordinates by writing a script. Thanks
Copy link to clipboard
Copied
Yes. Look up getPageNthWord and getPageNthWordQuads.
Copy link to clipboard
Copied
Please check out this link: http://help.adobe.com/livedocs/acrobat_sdk/9.1/Acrobat9_1_HTMLHelp/wwhelp/wwhimpl/common/html/wwhelp...
getPageNthWordQuads will only give quad property of the Annotation object which can be used for constructing text markup, underline, strikeout, highlight and squiggly annotations. How will this return the coordinates of the word?
Copy link to clipboard
Copied
getPageNthWordQuads returns the coordinates of the word.
Copy link to clipboard
Copied
To create annotations, markup etc the coordinates are needed. Examine what a "quad" type contains and bear in mind that text, which does not exist at a single point, does not have a single coordinate; rather it occupies space. The baseline is not provided.
Find more inspiration, events, and resources on the new Adobe Community
Explore Now