Skip to main content
Participant
October 9, 2023
Question

PDF to searchable PDF

  • October 9, 2023
  • 2 replies
  • 763 views

I have a requirement to convert pdf to a searchable pdf, where a pdf page contains both text and images with text. therefore i want to extract text from images and add an overlay over the image to maintain the searchability of  text and the images.

 

I'm using .net framework and tried using OCRSupportedType.SEARCHABLE_IMAGE_EXACT but it doesn't extract the text of the image. How to overcome this issue?

    This topic has been closed for replies.

    2 replies

    Raymond Camden
    Community Manager
    Community Manager
    October 11, 2023

    I'm a bit unclear. Are you talking a PDF of scanned images, or a PDF that includes a picture of a cat with a word on it, and you want the word?

    Participant
    October 9, 2023

    Or any other way to automate the Recognize Text function in the tool?