Skip to main content
KeithHo
Participant
August 4, 2023
Question

identifying whether a PDF has already been OCR'ed

  • August 4, 2023
  • 1 reply
  • 3564 views

When you use Adobe to OCR a PDF file, does that operation embed any particular codes or characters in the file that would enable me to identify whether the PDF file has already been OCR'ed?

I am trying to do a search in Explorer to identify all PDF files that have or have not been OCR'ed.

This topic has been closed for replies.

1 reply

AkanchhaS8194121
Legend
August 4, 2023

Hi @KeithHo 

 

Hope you are doing well.

When you use Adobe to OCR a PDF file, does that operation embed any particular codes or characters in the file that would enable me to identify whether the PDF file has already been OCR'ed?

 

If you can highlight the text with your cursor, it's recognized. If you cannot highlight the text, it is part of the image and is not recognized by assistive tools.

But this can be done only when you open and work with the file.

 

I am trying to do a search in Explorer to identify all PDF files that have or have not been OCR'ed.

 

There's no mechanism to identify the document's OCR status from file explorer. 

 

 

Thanks,

Akanchha