Skip to main content
Participant
September 23, 2016
Answered

Automating multiple page scanned PDF's to export to Excel

  • September 23, 2016
  • 1 reply
  • 887 views

We have a LOT of scanned PDF's at work (about 8,400) that we are needing to export into Excel. There are a few tricky parts to this as I'm trying to find if there's a way to automate all of this. These are all individual PDF's, but I know I could convert them all to a multipage PDF if I need to, that shouldn't be an issue. The major issue being we only want a small portion of all of the PDF's, the top left corner that has some basic information (name and address), to be exported to an Excel spreadsheet. The rest of the PDF we do not need. The are all single page PDF's if that helps anything on the automation formatting. Does anyone know if that can be done?

I asked this in the PDF forum but someone suggested to ask it in the java script forum instead.

This topic has been closed for replies.
Correct answer try67

It can be done, most likely, but it would involve a custom-made script to extract the text from the desired coordinates and then export it to a format that can be read using Excel (a tab-delimited text file, for example). This is not a simple task, though.

1 reply

try67
Community Expert
try67Community ExpertCorrect answer
Community Expert
September 23, 2016

It can be done, most likely, but it would involve a custom-made script to extract the text from the desired coordinates and then export it to a format that can be read using Excel (a tab-delimited text file, for example). This is not a simple task, though.

Participant
September 23, 2016

Thank you SO MUCH! I was wondering if it even could be done and sounds like it possibly can. So I guess now it's just finding someone who can write JavaScript?

try67
Community Expert
Community Expert
September 23, 2016

Correct. I'm such someone, by the way... If you want I'll be happy to check out some sample files and let you know if I think it can be done, and if so for how much. You can contact me directly at try6767 at gmail.com to discuss it further.