Problem converting PDF to TEXT - only on some pages
Hi,
We download PDF files from Walmart with our POs. These files are between 10-50 pages.
We use to export the PDF to TXT format and then import the TXT into our accounting program.
Lately, some of the pages (POs) have their data scrammbled. It might happen on 1-3 pages in the 40 page PDF.
It always happens in the section with the line items (most important part) and instead of splitting the line into shorter 1 line per item, it will have one super long line that mixes up the column headers with the data on the next line.
It isn't consistent and I can't seem to find out why this happens.
I tried this with Acrobat Pro 2019 DC, 2020 DC and even the latest 2021 DC. I even tried to non-DC 2020 version just to see what happens and the same scrambling of SOME sections on a few pages happens and always in the SAME place in the TXT file.
Strangley, I can usually use a workaround:
- open the PDF
- export to EXCEL format (option single worksheet)
- in Excel, SAVE AS ADOBE PDF (option entire workbook, fit to width)
- then open the new PDF and then export as TXT and it usually works properly
I tried online conversions but they are all terrible. What is the best way to convert a PDF to TXT format?
Thanks
Richard
p.s. I can email anyone a PDF to demo this problem
