Copy link to clipboard
Copied
We use the "Adobe Export PDF-App" to export scanned PDF-Files to Excel-Files.
All the original pdf pages are built very similiary. But after the conversion to an excel file sometimes the last columns (of the original document) are in the next rows (in the excel sheet).
Also the conversion sometimes creates seperate column and sometimes just puts all words/ numbers in one row and adds blanks between.
The first problem is more dramatically than the second.
How can we solve that? (we alreade use the highets available scan qualitity)
Converting one file format to another is always tricky. In this case you're doing two conversions: From a scanned image to "real" text (ie, OCR), and then from PDF to Excel. It's very likely that the final results will differ somewhat from the original. There's nothing that can be done about it.
Copy link to clipboard
Copied
Converting one file format to another is always tricky. In this case you're doing two conversions: From a scanned image to "real" text (ie, OCR), and then from PDF to Excel. It's very likely that the final results will differ somewhat from the original. There's nothing that can be done about it.
Copy link to clipboard
Copied
This means the use of PDF is impractical. In our case, we have PDF invoices printed from an e-Commerce site and emailed. Then Power Automate needs to convert the invoice back to actionable data fields. "It's very likely that the final results will differ somewhat from the original. There's nothing that can be done about it." means that using the Adobe PDF mechanism to tranfer data cannot lead to a solution.