Copy link to clipboard
Copied
Hi all, I read the following page:
https://developer.adobe.com/document-services/docs/overview/pdf-services-api/howtos/ocr-pdf/
It says "Optical character recognition (OCR) converts images to text so that you and your users can fully interact with the PDF file. After performing OCR, the PDF may be fully editable and searchable.".
I used a very simple PDF which includes a picture of mouse and a text that says "MICE", and I followed the documentation here:
https://developer.adobe.com/document-services/docs/apis/#tag/OCR
to perform OCR for the simple PDF file. I download the processed PDF file, and I found out that the text "MICE" included in the generated PDF is now seachable, but not editable by using Acrobat.
The page linked to the first link says "the PDF may be fully editable and searchable".
Is it possible to perform OCR so that the text can be editable, not just seachable?
Thank you!
You might be able to find one but I have no specific recommendation. The majority of OCR applications that operate on PDF are optimized for searching, not editing so it might be difficult to find one that works for you.
Copy link to clipboard
Copied
The service isn't able to always create an editable PDF which is why the description uses the word "may" rather than "will". It depends on the PDF and the proximity of the text to other page elements. Can you share your input PDF?
Copy link to clipboard
Copied
Copy link to clipboard
Copied
Ok thanks. I think the editability is being thrown off by the white text on the black background.
Copy link to clipboard
Copied
Thank you, Joel, for taking a look at it. I appreciate it.
In addition to the Adobe PDF Services API, there are many other PDF related API sets and many of them include the OCR feature. But, I can't find any reliable API that makes text be editable. If you know of anything that I should try, it would be great if you could let me know.
Thank you again!
Copy link to clipboard
Copied
You might be able to find one but I have no specific recommendation. The majority of OCR applications that operate on PDF are optimized for searching, not editing so it might be difficult to find one that works for you.
Copy link to clipboard
Copied
I see. Thank you very much for your help. I appreciate it.