Skip to main content
New Participant
October 4, 2017
Answered

PDF to EXCEL export is not accurate

  • October 4, 2017
  • 1 reply
  • 1908 views

We use the "Adobe Export PDF-App" to export scanned PDF-Files to Excel-Files.

All the original pdf pages are built very similiary. But after the conversion to an excel file sometimes the last columns (of the original document) are in the next rows (in the excel sheet).

Also the conversion sometimes creates seperate column and sometimes just puts all words/ numbers in one row and adds blanks between.

The first problem is more dramatically than the second.
How can we solve that? (we alreade use the highets available scan qualitity)

This topic has been closed for replies.
Correct answer try67

Converting one file format to another is always tricky. In this case you're doing two conversions: From a scanned image to "real" text (ie, OCR), and then from PDF to Excel. It's very likely that the final results will differ somewhat from the original. There's nothing that can be done about it.

1 reply

try67
Community Expert
try67Correct answer
Community Expert
October 4, 2017

Converting one file format to another is always tricky. In this case you're doing two conversions: From a scanned image to "real" text (ie, OCR), and then from PDF to Excel. It's very likely that the final results will differ somewhat from the original. There's nothing that can be done about it.

New Participant
February 25, 2021

This means the use of PDF is impractical.  In our case, we have PDF invoices printed from an e-Commerce site and emailed. Then Power Automate needs to convert the invoice back to actionable data fields.  "It's very likely that the final results will differ somewhat from the original. There's nothing that can be done about it." means that using the Adobe PDF mechanism to tranfer data cannot lead to a solution.