Replace or Repair OCR in scanned documents

Forum|Forum|8 years ago
January 18, 2018
1 reply
6973 views

We have about 2000 documents (reports) that were scanned to PDF. These have a text (OCR) layer, but the OCR is very bad, with breaks within most words and complete mis-alignment. The text layer is virtually useless. We have an immediate need to remove the OCR layer and re-create it with a better tool. I do not know the tool that was utilized when these were scanned.

Can Acrobat perform this task? If so, can it do a batch process?

I have looked in the past, and struggled with the fact that the pdf's have a text layer already, so no new OCR is performed.

Thanks in advance for any insights.

Chuck

This topic has been closed for replies.

Lovekesh Garg

Adobe Employee

Please run OCR again on all the documents. It will create a new layer of recognized text. Please use latest Acrobat DC for this as it supports running OCR again on OCRed documents instead of giving an error.

And you can OCR multiple files at a time using "In multiple files" option of Recognize text or by creating an action.

Thanks.

I

ibnabouna

Inspiring

Garg,

Thank you for the tip, but it doesn't seem to work on the latest Acrobat Pro DC for Mac (2018.011.20038). It still throws up the error about the PDF already having recognized text.

Is there something I missed?

AnandSri

Legend

Hello Ibnabouna,

Sorry for the delayed response and inconvenience caused. You may try sanitizing the current PDF file and see if that helps.

To sanitize the PDF, you can refer to the Adobe article Removing sensitive content from PDFs in Adobe Acrobat DC

You may also try to print the PDF through Print to Adobe PDF.

Is it possible to share the PDF file with us? To share the file, please use Adobe Send feature, upload the file, share the link to files via private message only, How Do I Send Private Message

Let us know how it goes and share your findings.

Regards,

Anand Sri.

Sign up

To post, reply, or follow discussions, please sign in with your Adobe ID.

Sign in to Adobe Community

To post, reply, or follow discussions, please sign in with your Adobe ID.

Scanning file for viruses.

This file cannot be downloaded