Improve bad OCR quality
For years now I have downloaded U.S. patents in PDF from Google Patents and other sources. Adobe's OCR engine routinely fails to recognize "fi" (recognizing it as "?"), misreads a lower case w as an upper case W, and sometimes misreads a lower case z as an upper case Z. I have seen this behavior consistently across hundreds of patent docuemtsn.
The only remedy I have found in this forum is to change each occurrence manually. That's not an acceptable solution. At a minimum, there should be a global search & replace. Is there?
Thanks,
- bill.
