Skip to main content
chucka81557753
Participant
January 18, 2018
Question

Replace or Repair OCR in scanned documents

  • January 18, 2018
  • 1 reply
  • 6891 views

We have about 2000 documents (reports) that were scanned to PDF. These have a text (OCR) layer, but the OCR is very bad, with breaks within most words and complete mis-alignment. The text layer is virtually useless. We have an immediate need to remove the OCR layer and re-create it with a better tool. I do not know the tool that was utilized when these were scanned.

Can Acrobat perform this task? If so, can it do a batch process?

I have looked in the past, and struggled with the fact that the pdf's have a text layer already, so no new OCR is performed.

Thanks in advance for any insights.

Chuck

This topic has been closed for replies.

1 reply

Lovekesh Garg
Adobe Employee
Adobe Employee
March 6, 2018

Please run OCR again on all the documents. It will create a new layer of recognized text. Please use latest Acrobat DC for this as it supports running OCR again on OCRed documents instead of giving an error.

And you can OCR multiple files at a time using "In multiple files" option of Recognize text or by creating an action.

Thanks.

Inspiring
March 29, 2018

Garg,

Thank you for the tip, but it doesn't seem to work on the latest Acrobat Pro DC for Mac (2018.011.20038). It still throws up the error about the PDF already having recognized text.

Is there something I missed?

AnandSri
Community Manager
Community Manager
April 27, 2018

Hello Ibnabouna,

Sorry for the delayed response and inconvenience caused. You may try sanitizing the current PDF file and see if that helps.

To sanitize the PDF, you can refer to the Adobe article Removing sensitive content from PDFs in Adobe Acrobat DC

You may also try to print the PDF through Print to Adobe PDF.

Is it possible to share the PDF file with us? To share the file, please use Adobe Send feature, upload the file, share the link to files via private message only, How Do I Send Private Message

Let us know how it goes and share your findings.

Regards,

Anand Sri.