Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

OCR does not recognize language correctly

Guest
Jun 22, 2016 Jun 22, 2016

We downloaded a trial version of Acrobat DC to see if we could use it to convert docs to PDF/A for records management and archiving purposes.

Some of the documents are just scans, and need OCR first. As we want to add metadata based on content, we have to open and treat each document individually. Acrobat DC immediately starts OCR-conversion without asking, on the assumption that:

- we want the document to be OCR'd (correct)

- it can identify the language of the document itself.

Well that second assumption is wrong: All documents we have tested are identified as being written in Dutch, whereas some are actually in French and even in English (incredible but true). So for every document we have to wait for the first OCR to complete and then have a rerun where we correct the language settings - which is extremely time consuming.

Is there a way to prevent OCR conversion starting automatically and have it run only after defining yourself what the language of the document is?

TOPICS
Acrobat SDK and JavaScript
423
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
Jun 23, 2016 Jun 23, 2016
LATEST

Thanks for reporting your concern.

Yes we can prevent it running OCR automatically.

- Go to Acrobat preferences (Ctrl+K or Edit> Preferences)

- Go to Convert to PDF> BMP/TIFF

- Edit Settings> Scan Optimization Settings> Uncheck “Recognize Text” checkbox.

It will disable OCR for all BMP/TIFF files while opening them in Acrobat

Now you can run Text recognition whenever you want with your own settings everytime.

Hope it will resolve your issue. Please feel free to ask anything you want.

Thanks.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines