Copy link to clipboard
Copied
We are having trouble completing OCR on documents we receive from the district attorney in discovery. I am an attorney. I have Adobe Acrobat 10 Pro. I went to Adobe's article, Fix the OCR error Could Not Perform Recognition in Acrobat. The solutions suggested there may be unworkable.
Solution 1 is to get a copy of the PDF in non-renderable text. I have a call into the district attorney to see if that is possible. I am very skeptical that the DA will want to change its procedures.
Solution 2 is to convert the PDF into a TIFF and then back again. While that works, it is not practical for the 8,000 pages of discovery we receive in a case. The problem is that the conversion from PDF to TIFF creates a separate document for each page. Those have to be recombined. That would take weeks of work, which is not practical.
I just want to OCR renderable text and end up with a document that I can search thoroughly.
Hi, did you try "Remove Hidden Information" on 'Protection' menu?
This function removes all rendering text (remember to click on "Remove" button), so you can perform OCR recognition.
I hope this help you.
Copy link to clipboard
Copied
Hi Matthew,
Sorry for the issue you are facing. 2nd solution is the only workaround available as of now. This is happening because your document is partially OCRed. And we are not running OCR on it again to avoid any loss of data.
For some kind of files we already resolve this issue in latest Acrobat DC, and for other we are working on, to get a better solution.
Please try Adobe Acrobat DC trial version if it resolve your issue Download Adobe Acrobat free trial | Acrobat Pro DC
If not, you can do 1 thing to fasten the 2nd solution:
- First export PDF to TIFF images in a folder
- Than select all images and drag them to Acrobat
- It will ask you to combine all images and run OCR. Click OK on the dialog comes for this
Now it won't take as much time as running OCR on an image individually.
Hope it will resolve your issue as of now. We will update you once the permanent solution is available for this problem.
Thanks.
Copy link to clipboard
Copied
Hi, did you try "Remove Hidden Information" on 'Protection' menu?
This function removes all rendering text (remember to click on "Remove" button), so you can perform OCR recognition.
I hope this help you.
Copy link to clipboard
Copied
Thanks. Yes, that worked.
Matt Werner
[private info deleted by host]
Copy link to clipboard
Copied
With the latest release of Acrobat DC on 11th April 2017, the issue of error "Page contains renderable text" has been resolved. Go to What's new in Adobe Acrobat DC for more details.
To get the latest product update, click on the menu Help-> Check for updates
Thanks.
Find more inspiration, events, and resources on the new Adobe Community
Explore Now