• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers
Exit
1

Adobe Extract API: Some text is extracted as images.

Guest
Feb 18, 2022 Feb 18, 2022

Copy link to clipboard

Copied

I am developing a system that uses the Adobe Extract API to extract sentences from PDF file in JSON format.

When I hit the Adobe Extract API against the PDF file, parts of the document where was the text was extracted as image.


In the image below, red part is extracted as text (expect result).
But blue part was extracted as image (unexpected result).

スクリーンショット (52).png

 

 


This is the code I used:
https://github.com/adobe/pdftools-extract-node-sdk-samples/blob/main/src/extractpdf/extract-text-tab...

 

This is the PDF file. Page of the image above is on page 59:

JP-N-KP-EPI-2000063_イーケプラ経口剤IF_rev18.pdf (ucbjapan.com)

 

The language of the PDF is Japanese. The PDF file is not a scanned paper file, but a PDF file of digitally created data.

Is there any information or solution for this?

TOPICS
PDF Extract API

Views

395

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Explorer ,
Feb 22, 2022 Feb 22, 2022

Copy link to clipboard

Copied

@Deleted User Did you find any solution to this outside the Adobe Community?

Had encountered a similar problem, but there's no resolution to it.

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Guest
Feb 28, 2022 Feb 28, 2022

Copy link to clipboard

Copied

LATEST

@Nikhil Ranka 

Unfortunately, we cannot find any solution yet.

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Resources