Copy link to clipboard
Copied
Hello,
I am using the "PDF Services Java SDK" (https://github.com/adobe/pdfservices-java-sdk-samples) to create a DOCX from a PDF document.
It mostly works, but some elements are transformed to text automatically (OCR) when it shouldn't.
For instance, we have a logo in the top left corner:
And sometimes, Adobe Services API tries to use OCR on it, which results in garbage characters:
Same goes for images of maps, which contain texts (texts, numbers) that the Adobe API messes up:
In the API, I don't see any options to prevent OCR on some elements or even to prevent OCR altogether.
The only option is the ability to pass a com.adobe.pdfservices.operation.pdfops.options.exportpdf.ExportPDFOptions objects, on which you can only set the preferred language for OCR:
Is there any way to prevent this behaviour ?
Thanks,
Fabien
Have something to add?