Skip to main content
May 19, 2017
Question

Reading contents of a PDF into EXCEL

  • May 19, 2017
  • 2 replies
  • 434 views

Is there any method through which I can read the contents of PDF into Excel sheet without using Sendkeys? (Code preferred in Python, VBA, Javascript only)

This topic has been closed for replies.

2 replies

Legend
May 22, 2017

Yes, using the VB bridge to JavaScript and the getPageNthWord method. This is described in the Acrobat SDK. Acrobat (not Reader) required As well as study of the SDK.

Legend
May 19, 2017

It depends what you mean by "contents".

* The raw collection of words in a PDF is not too difficult from VB/VBA

* Rows and columns are not even a concept in PDF, so this requires guesswork from positions

* Graphics is not possible through scripting

May 22, 2017

By contents, I mean some text and number. That's all.

I do have the code which runs perfectly in VBA. However, it uses sendkeys. Is there any method through which I can avoid the usage of sendkeys?