Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

How do I OCR a page that already has searchable text?

Explorer ,
Jan 19, 2016 Jan 19, 2016

The wonders with Adobe never cease.  Apparently, it won't OCR a page even if one little word on the page is already searchable.  Other programs which utilize pdfs don't have this issue.  Rather, I constantly marvel at how inferior Adobe is to handling its pdf creations.  Does anyone know a way to make Acrobat OCR the rest of the page?

TOPICS
Acrobat SDK and JavaScript
1.6K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Explorer ,
Jan 19, 2016 Jan 19, 2016

The error i get is:

"Acrobat could not perform recognition (OCR) on this page because:

This page contains renderable text."

It must just be pure genius of Adobe I don't understand to make things so difficult.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 20, 2016 Jan 20, 2016

Save the page as TIFF file. Create a PDF from the TIFF file. Perform OCR on the new PDF.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Explorer ,
Jan 20, 2016 Jan 20, 2016

Hi.  Exporting it to TIFF and back to PDF dramatically worsens the quality.  Is there a way instead, perhaps, to flatten the page so that Adobe no longer considers it rendered text?

This is pathetic of Adobe.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 20, 2016 Jan 20, 2016

‌Flatten it with save as TIFF.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Explorer ,
Jan 20, 2016 Jan 20, 2016

You're an Adobe Certified Professional? Saving to TIFF and converting back to PDF severely reduces quality.  I have a better idea:  why don't you get your company to handle OCRing pages that already have renderable text?  Clearly it can be done, since companies like Nuance's Omnipage an handle it with no difficulty whatseover.  Adobe is inferior at working with a format it invented, yet still charges plenty for its defective product.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Explorer ,
Jan 20, 2016 Jan 20, 2016

On top of that, converting to TIFF blows up the file size, wasting space and straining the hard drive.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Jan 20, 2016 Jan 20, 2016
LATEST

"ACP" = Adobe Community Professional.

Good to know - PDF has been an ISO Standard for some time now.

Just a nattering but the phrase "The lady doth protest too much, methinks" comes to mind.

Be well

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines