Skip to main content
New Participant
November 18, 2019
Question

Apply OCR to only part of a document

  • November 18, 2019
  • 1 reply
  • 6743 views

I have a scanned document that has text as well as charts and diagrams.  When convert the document to text, under Edit PDF, it converts the entire document to text.  This means that my diagrams and charts get converted as well and are nothing like the originals (especially the diagrams).

 

Is there a way to select areas to exclude from being recognized as text?

This topic has been closed for replies.

1 reply

try67
Community Expert
November 18, 2019

No, but running Text Recognition should not change the way the file looks, if you use the Clear Scan option.

New Participant
November 18, 2019

Well it thinks some of my diagrams are charactors when they are not.  How can I prevent this?  Also, what is the "Clear Scan" option?  I'm unfamilar with this.

try67
Community Expert
November 18, 2019

Sorry, I think I got it mixed up. "ClearScan" is the option where it does alter the look of the pages. You should use the "Searchable Image" option to keep the page looking as it was, but add a hidden layer of text to it.

These options are available when you click the Edit button in the Text Recognition dialog.

There's no way to avoid false positives, though, with either method.