• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers

Convert a PDF image file to .txt doesn't output any text.

New Here ,
May 10, 2021 May 10, 2021

Copy link to clipboard

Copied

Hi Community, 

It seems that the export of PDF to txt doesn't launch OCR recognition. When converting the attached document to .txt, no text gets recognized.

However converting to DOCX or HTML outputs the recognition text. Please find the script to reproduce in enclosure. 

 

Is it expected? Any suggestion to trigger OCR when converting to .txt would be highly appreciated.

 

Thanks & regards,

Simon

TOPICS
Acrobat SDK and JavaScript , Windows

Views

183

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines

correct answers 1 Correct answer

Community Expert , May 11, 2021 May 11, 2021

When you export as text Acrobat doesn't perform OCR on the document.

Export as Word or HTML has this option.

Likes

Translate

Translate
Community Expert ,
May 10, 2021 May 10, 2021

Copy link to clipboard

Copied

Why does you use "com.adobe.acrobat.accesstext" ?

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
May 10, 2021 May 10, 2021

Copy link to clipboard

Copied

Hmm I put it to experiment. I guess "com.adobe.acrobat.plain-text" is the most apropriated. I'm not actually sure about what does "accesstext" means...

However "com.adobe.acrobat.plain-text" still doesn't trigger OCR and outputs a 6 bytes .txt file, containing the UTF16 BOM + 2 space characters. 

 

Any further idea ? 

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
May 11, 2021 May 11, 2021

Copy link to clipboard

Copied

LATEST

When you export as text Acrobat doesn't perform OCR on the document.

Export as Word or HTML has this option.

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines