Skip to main content
Participant
June 9, 2021
Question

Export to Excel is a mess

  • June 9, 2021
  • 3 replies
  • 4122 views

Trying to convert the following kind of text arrangement on a PDF into an excel file:

 

When I try to convert I get the following: 

 

However, when I convert text from a PDF that has lines around the text in a table fashion, then it converts perfectly. 

 

There must be a way I can convert this lineless text like in the first picture. I can't keep typing this out manually 😞

This topic has been closed for replies.

3 replies

Thom Parker
Community Expert
Community Expert
June 9, 2021

If you want to parse a table, then get a 3rd party table parsing tools, like Tabula.  Parsing tables is non-trivial and you will find that there are very few tools that do it well, if at all.

 

 

Thom Parker - Software Developer at PDFScriptingUse the Acrobat JavaScript Reference early and often
J E L
Community Expert
Community Expert
June 9, 2021

Have you tried exporting the PDF as a Word document, then seeing if Word can automatically convert the text to a table, and then exporting the Word table to Excel? Just a thought...

Community Expert
June 9, 2021

I would save as text from acrobat. Open in text editor, and cleanse the text, removing extraneous junk, replace spaces with tabs, etc. Then open text file in Excel for a more workable spreadsheet.