Horrendous, inaccurate OCR conversion
I took screenshots of a public domain book that was already digitized into fonts but which were not searchable. The result was better than any scan of paper would have produced. When I exported from PDF to Word I was horrified to see the OCR results which were probably worse than the software I used a decade ago - we're talking about maybe 90% accuracy.
I used Nuance AdvancedPDF on the same document - software which is from several years ago and no longer available so it's not a competitive product any more. The conversion was in the high 99% and near 100. The only thing it missed was consistent typeface so that pages were bolded text rather than plain text - which is fine and easily correct. Some pages had a slightly wrong font typeface. Also easily corrected. But the characters were near 100% conversion as they should have been.
So after spending all this money on the Adobe version because it's supposed to be the standard setter, why is the OCR conversion engine so horrible to the point of being unusable?
