• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers

BAD_PDF Error in Extract API

New Here ,
Sep 11, 2022 Sep 11, 2022

Copy link to clipboard

Copied

See the attached PDF.  The Extract API is throwing an error – “BAD_PDF - Unable to extract content” with no additional information.  What is wrong with the PDF? How can I program the API to fix the error or skip pages that show an error and keep processing?  Thanks. 

TOPICS
PDF Extract API

Views

72

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 26, 2022 Oct 26, 2022

Copy link to clipboard

Copied

LATEST

A similar issue was faced by me. After viewing the attached pdf, I observed that you have different page widths of those containing tables. In such case, the API is most likely to fail. A work around could be preprocessing the pdf before passing to API and making sure that a given sub-part of the original pdf has all the pages of same size. This can resolve the issue.  

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Resources