Skip to main content
July 28, 2016
Question

OCR a Newer Scanned Document that looks like a jpeg.

  • July 28, 2016
  • 1 reply
  • 838 views

Hi There,

At work, we get documents that are scanned from banks and they come in black and white previously.  We would be able to take those and use the "Enhance Scans" feature to either recognize text or we are able to convert to pdf's.   We transcribe a lot of the data and being able to convert it to pdf or enhance the pages and grab the text and use it.  We switched banks and now, we get documents with what appears to be newer technology.  They are all in color and adobe picks them up as a jpeg or picture and Adobe can't recognize them as render-able documents.  This greatly increases our workload because we can do any OCR in these documents.   Is there a way to get these in a position to utilize "Enhance Scans" or make them convertible to excel?

This topic has been closed for replies.

1 reply

Lovekesh Garg
Adobe Employee
Adobe Employee
July 29, 2016

Hi Chris,

Sorry for the issue you are facing. Can you please provide following information to help us identify and resolve the issue ASAP:

- Acrobat version (Help>About Acrobat)

- OS version

- Are you getting any error message (like This page contains renderable Text)

- Can you please share a sample document where you are facing this issue (you can use https://cloud.acrobat.com/send )

Thanks.

July 29, 2016

Yes I can!

- Adobe Acrobat DC

- Windows 10

- "Acrobat could not perform Text Recognition on this page because

     The page contains renderable Text"

- Shared Files - Acrobat.com

I appreciate the help!

Lovekesh Garg
Adobe Employee
Adobe Employee
August 1, 2016

Issue here is the file you are using have some live text, so we are stopping it to run further OCR to avoid impacting exiting text.

That's why you are getting this error message. Right now this is expected behavior for this kind of files.

For the time being you can use 1 workaround here.

- Open file and go to File> Export to> Image> Jpeg

- It will save your PDF to images

- Now open all these files in Acrobat again

- It will ask you to combine all images in 1 PDF (if more than 1 files are selected)

- Now run OCR again

- This time it will run OCR on complete page

This may be an overheads as some extra steps are involved. But it will resolve your issue. This is the only workaround currently exist. But not for much longer as we are working to handle it in near future.

Please feel free to ask anything you want.

Thanks,

Lovekesh Garg.