Skip to main content
Participant
September 30, 2016
Question

Extracting Email Information from PDF

  • September 30, 2016
  • 1 reply
  • 538 views

Hello,

I have a several thousand page PDF of emails that have been printed and aggregated from Outlook.

The PDF is searchable.

I would like to extract the From, To, Sent, and Subject information to an Excel file.

The final output would be a table in excel that collects the From, To, Sent and Subject data from every occurrence of such in the PDF (with maybe a page reference as well, but that's less important).

Can this be done? Something relating to forms? If so, can someone point me in the right direction to learn this functionality?

Thanks,

Casey

This topic has been closed for replies.

1 reply

try67
Community Expert
Community Expert
September 30, 2016

Possibly, but it will require a complex, custom-made script to be developed.

It's impossible to say for sure if it can be done or not without seeing some sample files.

If you're interested in hiring someone to develop this script for you feel free to contact me at try6767 at gmail.com.