Skip to main content
mathiasc5545865
Participant
July 18, 2019
Question

Extracting data from multiple PDF files.

  • July 18, 2019
  • 1 reply
  • 2192 views

I have a large number of PDF certificates in individual PDF's, from which I would like to extract some text out of and into a CSV / Excel file (multiple or one). It seems to be easy enough if the data was from forms, but its just plain text (not scanned) in the files. Does anyone have any idea to do this, if it's possible?

It's the data in the red frame I want to extract:

This topic has been closed for replies.

1 reply

try67
Community Expert
Community Expert
July 18, 2019

This is often possible using a custom-made script, but it's not possible to say for sure without seeing the actual file.

If you're interested in hiring someone to do it for you feel free to send me such a file to [try6767 at gmail.com] and I'll let you know if I think it's possible, and if so for how much. I've developed many similar tools in the past for my clients.