OCR skips some text

Question

I have a large PDF of an old book that I'm trying to convert to text in order, ultimately, to create an ebook. The print-ready PDF has been supplied by the printer but we don't have access to the original InDesign (or whatever software was used) files of the layout some 20-odd years ago. The PDF file is essentially just page images, so the text needs to be freshly OCRed, so I'm trialling the latest Adobe Acrobat DC for this purpose.

OCR seems to work quite well on the text that Acrobat recognises, but it is passing off large slabs of text. The image below shows what I mean;

Is there a way I can force Acrobat to OCR regions not automatically identified as text?

Lovekesh Garg · Answer

Please try the different option of OCR. Go to Tools> Enhance Scans> Recognize Text> In this file> Recognize Text

It should recognize Text properly. But it won't allow you to do any Editing. But you can correct any text it recognized incorrectly using "correct recognize text" option in drop down.

Thanks.

Sign up

To post, reply, or follow discussions, please sign in with your Adobe ID.

Sign in to Adobe Community

To post, reply, or follow discussions, please sign in with your Adobe ID.

Scanning file for viruses.

This file cannot be downloaded