Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

ocr gone after removing hidden information

New Here ,
May 02, 2018 May 02, 2018

Is there a configuration setting that can be applied that will not remove OCR after you redact and remove hidden information/sanitize?  We use Acrobat Pro DC.

I found a reference to use PDF Output ClearScan but that does not appear to be an option in the recognize text general settings.

Or is the solution just to rerun text recognition after the redaction and removing hidden information?

Thanks in advance.

Wendy

2.1K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
1 ACCEPTED SOLUTION
Community Expert ,
May 03, 2018 May 03, 2018

Use the option "Editable Text and Images".

View solution in original post

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
May 03, 2018 May 03, 2018

Use the option "Editable Text and Images".

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 18, 2019 Oct 18, 2019
LATEST

So here's a similar issue and my solution:  Sometimes the OCR isn't as good as I need and when I use the redaction tool it only captures part of word or parts of letters/numbers I need to remove.  Removing Hidden Text has not worked because Adobe still gives me the text cursor (I) not the image (+) which will let me override its failed attempt at OCR'ing.  Using the Preflight flattening has not worked reliably.  My solution is to print the pdf using Microsoft Print to pdf.  It automatically turns the pdf flat and allows me to edit everything on the page.  To use this solution though, the automatic OCR preference must be turned off.  The only annoying downside is you can't use the Redaction tool's search and remove as a first run through to remove everything OCR can catch.  Inevitably, I OCR remove that OCR can catch, then print to pdf and remove everything OCR couldn't read. It shouldn't be that hard to alternate between the text and image tool.  If OCR fails to read something, the program should automatically allow you to use the image function instead. 

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines