ocr gone after removing hidden information

Report · May 02, 2018

Is there a configuration setting that can be applied that will not remove OCR after you redact and remove hidden information/sanitize? We use Acrobat Pro DC.

I found a reference to use PDF Output ClearScan but that does not appear to be an option in the recognize text general settings.

Or is the solution just to rerun text recognition after the redaction and removing hidden information?

Thanks in advance.

Wendy

Report · May 03, 2018

Use the option "Editable Text and Images".

View solution in original post

Report · May 03, 2018

Use the option "Editable Text and Images".

Report · Oct 18, 2019

So here's a similar issue and my solution: Sometimes the OCR isn't as good as I need and when I use the redaction tool it only captures part of word or parts of letters/numbers I need to remove. Removing Hidden Text has not worked because Adobe still gives me the text cursor (I) not the image (+) which will let me override its failed attempt at OCR'ing. Using the Preflight flattening has not worked reliably. My solution is to print the pdf using Microsoft Print to pdf. It automatically turns the pdf flat and allows me to edit everything on the page. To use this solution though, the automatic OCR preference must be turned off. The only annoying downside is you can't use the Redaction tool's search and remove as a first run through to remove everything OCR can catch. Inevitably, I OCR remove that OCR can catch, then print to pdf and remove everything OCR couldn't read. It shouldn't be that hard to alternate between the text and image tool. If OCR fails to read something, the program should automatically allow you to use the image function instead.