Skip to main content
Participant
March 10, 2018
Answered

How can I detect the fontsize of a character in a pdf document using javascript

  • March 10, 2018
  • 2 replies
  • 835 views

I would like to automatically identify and edit the headlines of pdf text documents (newspaper articles). As the fontsize of headlines use to be larger than the text body, I could identify the beginning and the end of a headline by screening the fontsize changes of characters. Unfortunatelly I have not yet found a javascript method of dectecting fontsizes, while this information is provided in the UI - even for proprietary fonts. Can anybody help?

This topic has been closed for replies.
Correct answer Thom Parker

Use the "doc.getPageNthWordQuads()" function.  It returns 4 points, one for each corner of the bounding box of a word on the page. The height of this box is the font size.

2 replies

try67
Community Expert
Community Expert
March 10, 2018

You can't edit static text using a script, though. Not directly, anyway.

Thom Parker
Community Expert
Thom ParkerCommunity ExpertCorrect answer
Community Expert
March 10, 2018

Use the "doc.getPageNthWordQuads()" function.  It returns 4 points, one for each corner of the bounding box of a word on the page. The height of this box is the font size.

Thom Parker - Software Developer at PDFScriptingUse the Acrobat JavaScript Reference early and often
Participant
March 11, 2018

Many thanks for this advice. It works perfectly.