Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

converting "image pdf text" to "text text" – for good

New Here ,
Aug 22, 2016 Aug 22, 2016

I have a book-sized collection of PDF documents that are "text as image" (scanned). I am trying to convert them to "text text" so that one can open the resulting PDF document with Reader and have "proper text" in them.

So far, I have found how to edit the text in document, but that only seems to apply to the page I'm viewing. If I export the document as "text" to check what has been transformed from image to text, I only get snippets of the document. I have also found out how to make the image text searchable, but that keeps it as "image".

What is the best way to achieve what I'm trying to do – ie, export a document of editable text with decent text flow?

TOPICS
Acrobat SDK and JavaScript
534
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines

correct answers 1 Correct answer

Community Expert , Aug 22, 2016 Aug 22, 2016

In Acrobat you should use the Recognize Text command and select the "Searchable Image" option. That will keep your images as they are, but will insert an invisible layer of text underneath them (if the OCR process is successful, of course), that you could search and export to other formats.

Translate
Community Expert ,
Aug 22, 2016 Aug 22, 2016
LATEST

In Acrobat you should use the Recognize Text command and select the "Searchable Image" option. That will keep your images as they are, but will insert an invisible layer of text underneath them (if the OCR process is successful, of course), that you could search and export to other formats.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines