Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Adobe Acrobat reader pro question: retrieve 100% of all text OCR recognized

New Here ,
Nov 21, 2025 Nov 21, 2025
LS,
I’ve noticed an issue when exporting OCR results from a PDF using Adobe Acrobat Reader Pro: some text appears to be missing in the exported file. The PDF contains both regular text and images with embedded text. While performing a search within Adobe Acrobat Reader Pro (when OCR processing is done), I can find character strings detected in the images that do not appear in the exported text file. So OCR has recognized these characters but they have not been exported.
 
Based on my research, it seems that the only way to extract all characters recognized by Adobe Acrobat’s OCR process is to combine its functionality with a Python script that uses PyMuPDF.
 
Could you please confirm whether this conclusion is correct?
 
Thank you in advance for your assistance.
 
Best regards,
Kees Besse
TOPICS
Edit and convert PDFs , How to , PDF , Scan documents and OCR
57
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Nov 21, 2025 Nov 21, 2025
LATEST

That should not be happening. Can you share a sample file? Also, how exactly are you exporting the text from Acrobat? (not Acrobat Reader Pro... There's no such thing and Reader can't perform OCR)

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines