Skip to main content
Participating Frequently
July 29, 2024
Question

Issues facing in OCR Pdf

  • July 29, 2024
  • 1 reply
  • 838 views

I have a requirement to convert pdf into docx , so when i am converting my ocr pdf into docx, I am facing some challenges like  the handwritten characters are excluding from the ocr, thats why those regions are getting messed. And the regions with ash colour background are also getting messed like the images below.


Could you please help me how to overcome this kind of challenges or like did i get any support paid apis?

    This topic has been closed for replies.

    1 reply

    Raymond Camden
    Community Manager
    Community Manager
    July 29, 2024

    It may, in this case, simply be that the OCR wasn't able to handle that handwriting. If you open the PDF in Acrobat, OCR it there, does it work?

    Participating Frequently
    July 30, 2024

    Sorry I did n't checked in the adobe acrobat

    Participating Frequently
    July 30, 2024

    Now I have tried in Adobe Acrobat, When apply OCR a pdf that time it is adding a digitized layer over the handwritten regions, In next step when I use export pdf to convert pdf into docx, then those regions are getting messed up like above...

    Can you help me to understand, do you have any customized solution to exclude handwritten regions from OCR pdf....>?, so that the handwritten regions would remain intact...