• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers
Exit Search
0

scan documents and ocr

New Here ,
Jan 17, 2023 Jan 17, 2023

Copy link to clipboard

Copied

After I covert a large pdf (1600 pp) to OCR and save, when I scroll through each page seems to reset OCR and then suddenly it seems OCR goes away entirely-leaving no pages converted. What can I do? Thanks - any help would be appreciated!

TOPICS
Scan documents and OCR

Views

153

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 17, 2023 Jan 17, 2023

Copy link to clipboard

Copied

I don't personally know of any particular limit to number of pages (although there was a dimension limit at one time of 45" x 45"), but I suspect that trying such a large document might be stressing your Acrobat/system's memory and silently "quitting". Have you tried breaking the document into smaller chunks, OCRing them, and then recombining? Not ideal, but might be the only answer at this time.

 

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 17, 2023 Jan 17, 2023

Copy link to clipboard

Copied

LATEST

Chiming in with Brad @ Roaring Mouse.

You might be maxing out your RAM and VRAM. 1600 pages of scanned and OCR text and graphics is a helluva lotta data to process. The scanned page never goes away: it's always still in the PDF file as a graphic along with the OCR'ed recognized text.

 

You could try breaking up the file as Brad suggests.

Or you could try reducing the scanned resolution. By default it's set to 600 dpi (Recognize Text / Settings / Resolution). Maybe a slight reduction to 400 or even 300 dpi might lighten the digital load on your computer. But note that lower resolutions might not capture the text clearly enough to differentiate between e and o and other commonly mis-identified characters. Depends upon how clear the original paper version is.

 

Bevi Chagnon | PubCom | Designer & Technologist for Accessible Documents
| Books & Classes | Accessible InDesign | Accessible PDFs | Accessible MS Office |

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines