Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

How to OCR a 2 column document (textbook) and not have the text from the two columns combined

New Here ,
Jul 07, 2021 Jul 07, 2021

Hello, I am scanning in pages from a textbook that is setup as two columns (read the page all the way down the left column then all the way down the right column) and OCRing the page using Adobe DC Pro.  After OCRing the page, the text is editable, but the text from both the left and right columns are combined / intermixed making the page illegible so you have to sift through the text to extract it in the order you want.  Is there a way to OCR the page so that all of the left column is OCRed first then all of the right column is OCRed?  I don't see any option to select "zonal" OCR (terminology may not be correct) in the program?  Please advise.  Thank you.

TOPICS
How to , Scan documents and OCR
4.5K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jul 07, 2021 Jul 07, 2021

Not possible.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jul 07, 2021 Jul 07, 2021
LATEST

As Bernd, said, this is not possible IN Acrobat.

 

But it is possible depending on how you scan the page. 

 

First off, forget about doing the scanning from within Acrobat. Just find the software that came with your scanner and run that directly.

 

Some scanners can scan two different regions sequentially (as if you have two (or more) photographs on the glass). If you can do that, great. otherwise, you just have to run the scanning process twice (or more depending on the original page's layout).

 

Set your scanner to save each one in a convenient place (Desktop?) and save in the TIF format. After you have both columns scanned, than open each of these documents in Acrobat. Since they are in the TIF format, they should both be converted to PDF and then OCRed automatically. Once complete, you can copy out the text or export as a Word document.

 

Yes this is more tedious than if it could be done within Acrobat, but it's better than trying to match up the two half's as you've been attempting.

 

As long as I have your attention, here's a blog I wrote for Adobe on how to get better scanning results.

http://photosbycoyne.com/Gary's_Help/Scanning/clean-scanning.html

 

Good luck!

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines