Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Compressing a scanned file for size (not to recognize text) resulted in contents full of errors

New Here ,
Jan 29, 2025 Jan 29, 2025

I used my Adobe Acrobat Pro account to compress a large scanned 98MB PDF file for size, not to recognize text, and the resulting compressed file is full or errors. It seems the fonts are all messed up with certain letters being replaced with white space (kind of like in the case of redacting), and so the contents are mostly ineligeble in this compressed file. It seems to me like it's reducing its size by literally deleting characters, which seems kind of absurd. What is going on that is causing this? I have not intended to or need to run text recognition on the file as I realize it is not the best quality text and there is handwritten notes throughout sporadically. How can I get a smaller size file without altering the contents (vs. reduction in clarity quality, which I am okay with!)?

 

Thanks!
-Yasmeen

TOPICS
How to , Manage files
374
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines

correct answers 1 Correct answer

Adobe Employee , Feb 19, 2025 Feb 19, 2025

Hello,

 

I hope you're doing well, and we apologize for the delayed response and the trouble.

 

Is it specific to one file or is it with all the PDFs? When compressing scanned PDF files in Adobe Acrobat without performing Optical Character Recognition (OCR), it's essential to adjust specific settings to prevent content errors.

Use the 'Optimize Scanned PDF' Feature:

  • Open your scanned PDF in Adobe Acrobat.
  • Navigate to Menu/Tools > Optimize PDF.
  • In the toolbar, select Optimize Scanned PDF.
  • In the
...
Translate
Adobe Employee ,
Feb 19, 2025 Feb 19, 2025
LATEST

Hello,

 

I hope you're doing well, and we apologize for the delayed response and the trouble.

 

Is it specific to one file or is it with all the PDFs? When compressing scanned PDF files in Adobe Acrobat without performing Optical Character Recognition (OCR), it's essential to adjust specific settings to prevent content errors.

Use the 'Optimize Scanned PDF' Feature:

  • Open your scanned PDF in Adobe Acrobat.
  • Navigate to Menu/Tools > Optimize PDF.
  • In the toolbar, select Optimize Scanned PDF.
  • In the dialog box, ensure that Adaptive Compression is enabled. This setting helps reduce file size while maintaining content integrity. For detailed guidance, refer to this article.

Disable OCR During Optimization:

  • Within the same Optimize Scanned PDF dialog, look for the Recognize Text option.
  • Uncheck the Recognize Text option to prevent Acrobat from applying OCR during compression.

Check for Font Issues: Ensure the fonts used in the scanned document are correctly embedded. Sometimes, font issues can cause errors in the compressed file. Adobe provides guidance on how to manage fonts in PDFs: Fonts in PDFs. See this article for detailed information.

 

If you own the PDF and can access the source file, try recreating it using Acrobat, ensuring the fonts are correctly embedded.

 

Let us know how it goes.

Thanks,

Anand Sri.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines