Skip to main content
qianglu
Participant
March 17, 2017
Answered

How to OCR Tibetan in Adobe Acrobat Professional

  • March 17, 2017
  • 1 reply
  • 2519 views

I searched online and found an answer in this link The Tibetan and Himalayan Library..

This is the process for running OCR on a PDF so that it is searchable, using Acrobat Professional:

  1. For most PDFs, you want to run Optimize after you scan them. First rename the file; then pull down the Document menu and select Optimize.
  2. Then, to run OCR: open the PDF file you want to run OCR on.
  3. Pull down the File menu, choose "Save as," and add "-ocr.pdf" to the file name
  4. Pull down the Document menu, point to "OCR Text Recognition," and then point to "Recognize Text Using OCR…" and "start"
  5. The OCR process will start. It will take some time, depending on the number of pages in the PDF.
  6. When it finishes, save the file. Be sure to check by doing a search on "the" or another word in the file and make sure it returns results.

To OCR roman text with diacritic characters, investigate using Abbyy's FineReader (http://www.abbyy.com/). No THL staff have used this and we have no experience with it. For more information, see Zach Rowinski's assesssment.

Read more:  http://www.thlib.org/tools/wiki/How%20to%20OCR%20a%20PDF.html#ixzz4bcSy4Ql1

However, I couldn't even find the Document manu in my Acrobat for mac. I wonder what version of the Acrobat that could have Document manu and that can recognize Tibetan as it described in the above link?

Qiang Lu

This topic has been closed for replies.
Correct answer Lovekesh Garg

Sorry to say but OCR in Tibetan is not supported for Acrobat. I will raise a feature request for you. Team will look into this and update you if support for Tibetan is added in Acrobat.

Thanks.

1 reply

Participating Frequently
March 17, 2017

Hi Qianglu,

Sadly, the application is trying to perform OCR, however converting Tibetan script is not an option for the OCR engine.

Also, please share the application and operating system versions you are using and elaborate which document menu are you referring to, to help us understand the issue better.

Thanks,
Supriya

Lovekesh Garg
Adobe Employee
Lovekesh GargCorrect answer
Adobe Employee
March 20, 2017

Sorry to say but OCR in Tibetan is not supported for Acrobat. I will raise a feature request for you. Team will look into this and update you if support for Tibetan is added in Acrobat.

Thanks.