Character Encoding Error Using Acrobat Pro 2020 OCR
Acrobat Pro 2020.005.30467 (had an update this week)
Windows 10 Pro 19045.2728
Experience: Just a user; no programming.
I finalize reports that need to pass the basic 508 requirement (not for the web).
With these reports, I usually have to insert a scanned signature page (scanned by various people using various scanning equipment), and then OCR it so that I can apply tags. The signature page is prepared with a wet (not digital) signature. Most of the time after applying OCR, the signatures get tagged as figures with a description that they are approvals/signatures. No problem. No issues.
Some reports are required to have a Quality Control (QC) statement. These are usually sent as a digital Adobe Signed document that are made accessible after applying OCR.
Included in the reports are appendices of various documents (shipping docs, invoices, data sheets, other reports, etc.) that have to be scanned, OCRd, and made accessible. Sometimes these documents are sent electronically as pictures but then have to be OCRd to make accessible. All important information is tagged as text and not figures.
For some reason, Acrobat is frequently creating character encoding errors after the OCR or applying tags to the digitally signed QC statements. The majority of character encoding errors are with the signatures of the scanned pages.
In the past this happened occasionally with math equations or scientific formulas created in MS Word (Word to PDF), but now it's happening a lot! with signatures and other items.
I've tried all sorts of things: saving as a picture then back to a PDF to tag it; save as a PDF; save to a PDF using the PDF printer driver; popping it into Adobe Illustrator and back to a PDF. I tried using the preflight tool. I looked at fonts installed on my system. I can't even remember all of the things I have tried.
(I do not have Adobe PhotoShop.)
In many cases, the source document is the document being used and has to be OCRd.
The important question is Why all of the character encoding errors now when using Acrobat's OCR feature?
Thank you for any help or direction.
