Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Fix the OCR error Could Not Perform Recognition in Acrobat

New Here ,
Aug 23, 2016 Aug 23, 2016

We are having trouble completing OCR on documents we receive from the district attorney in discovery.  I am an attorney.  I have Adobe Acrobat 10 Pro.  I went to Adobe's article, Fix the OCR error Could Not Perform Recognition in Acrobat.  The solutions suggested there may be unworkable. 

Solution 1 is to get a copy of the PDF in non-renderable text.  I have a call into the district attorney to see if that is possible.  I am very skeptical that the DA will want to change its procedures.   

Solution 2 is to convert the PDF into a TIFF and then back again.  While that works, it is not practical for the 8,000 pages of discovery we receive in a case.  The problem is that the conversion from PDF to TIFF creates a separate document for each page.  Those have to be recombined.  That would take weeks of work, which is not practical. 

I just want to OCR renderable text and end up with a document that I can search thoroughly. 

TOPICS
Acrobat SDK and JavaScript , Windows
7.4K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines

correct answers 1 Correct answer

Deleted User
Aug 26, 2016 Aug 26, 2016

Hi, did you try "Remove Hidden Information" on 'Protection' menu?

This function removes all rendering text (remember to click on "Remove" button), so you can perform OCR recognition.

I hope this help you.

Translate
Adobe Employee ,
Aug 23, 2016 Aug 23, 2016

Hi Matthew,

Sorry for the issue you are facing. 2nd solution is the only workaround available as of now. This is happening because your document is partially OCRed. And we are not running OCR on it again to avoid any loss of data.

For some kind of files we already resolve this issue in latest Acrobat DC, and for other we are working on, to get a better solution.

Please try Adobe Acrobat DC trial version if it resolve your issue Download Adobe Acrobat free trial | Acrobat Pro DC

If not, you can do 1 thing to fasten the 2nd solution:

- First export PDF to TIFF images in a folder

- Than select all images and drag them to Acrobat

- It will ask you to combine all images and run OCR. Click OK on the dialog comes for this

Now it won't take as much time as running OCR on an image individually.

Hope it will resolve your issue as of now. We will update you once the permanent solution is available for this problem.

Thanks.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Guest
Aug 26, 2016 Aug 26, 2016

Hi, did you try "Remove Hidden Information" on 'Protection' menu?

This function removes all rendering text (remember to click on "Remove" button), so you can perform OCR recognition.

I hope this help you.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Aug 26, 2016 Aug 26, 2016

Thanks.  Yes, that worked.

Matt Werner

[private info deleted by host]

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
Apr 19, 2017 Apr 19, 2017
LATEST

With the latest release of Acrobat DC on 11th April 2017, the issue of error "Page contains renderable text" has been resolved. Go to What's new in Adobe Acrobat DC for more details.

To get the latest product update, click on the menu Help-> Check for updates

Thanks.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines