Skip to main content
Participant
June 8, 2020
Question

Memory leak (which leads to out of memory) when doing OCR

  • June 8, 2020
  • 1 reply
  • 6251 views

Environment: Acrobat Pro DC version 2020.009.20063. Windows 10 version 1909 (18363.778). Total 16GB of physical RAM. Locale zh/cn.

 

I used Acrobat Pro DC to do an OCR for a scanned textbook (~500 pages, 250MB). When it progressed to ~Page 90, following 3 consequtive error dialogs were shown, and then the recognition stopped:

- "unknown error"

- "unable to locate the paper capture recognition service"

- "out of memory"

(Text may be inaccurate because the original dialog is shown in Chinese. See attached screenshots for original text.)

Task Manager showed that it used 3.5GB of RAM, the maximum value for a 32-bit program.

 

I found a workaround for this problem, that is to recognize only 80 pages, save result and then restart Acrobat before it runs out of memory. However this is very time-consuming so I hope it will be fixed.

 

I have attached sample.pdf (repeated 120 pages of TOC from the book) to reproduce this issue. Use OCR option "Chinese (simplified)", "searchable image", "300 dpi", and it will OOM at Page 91.

This topic has been closed for replies.

1 reply

Amal.
Community Manager
Community Manager
June 8, 2020

Hi there

 

We are sorry for the trouble. We tried to reproduce the issue on our end and its working fine.

 

Please update the application to the new version 20.009.20067  and see if that works for you. Go to Help > Check for Updates

 

You may also try to repair the installation. Go to Help > Repair Installation and see if that makes any difference.

 

Regards

Amal

Participant
June 9, 2020

Hallo,

 

wir haben hier seit einigen Tagen genau dieses Problem! Version ist aktuell = 20.009.20067

Ich konnte mit der Beispiel-Datei den OCR-Abbruch reproduzieren.

 

Viele Grüße

Participant
June 12, 2020

Anbei ein Video, welches das Problem darstellt.

siehe = OCR_Problem_2020-06-09 


This is not unique. Our organisation has over 20 Acrobat DC Pro licences on W10 and since this "latest update" to 20.009.20067, any attempt to do OCR crashes out about page 80-90 (of a 450 page document) with Out of Memory and Acrobat is using 3.6Gb of RAM on an 8Gb RAM setup.


Adobe - this is a bug which needs fixed. I have an old Acrobat 8 Standard installation on W10 and the 450 page document was OCR'd wthout error - and with Acrobat 8 Std using no more than 220Mb RAM.


Acrobat 8 has no "batch" OCR facility and I have about 100 documents of 400-500 pages to OCR. The idea of opening each one and setting OCR manually on each one is awful. Prior to this issue, I would set them all to OCR over a weekend.

PLEASE PLEASE can this be fixed soon instead of in 12 months time!! I am a Head of IT and it is 100% an Acrobat DC memory leak, not your usual "check version", "check updates", "restart workstation" fix.