Skip to main content
Participating Frequently
May 2, 2023
Question

Images Missing during extraction

  • May 2, 2023
  • 1 reply
  • 1566 views

Adobe Extract API fails to capture some of the images in pdf's when there are more than one page in pdf(Eg. Logo).

When I try the same pdf by splitting into single page pdf, all images are getting exctracted.

This is not just for one pdf, out of 100 cases with different type pdfs, 70% of multi page pdf's having similar issues.

I tried with Power Automate Adobe API & Python SDK. Both showing same results.

Anyone faced similar issues like this?

This topic has been closed for replies.

1 reply

Joel Geraci
Community Expert
Community Expert
May 2, 2023

Are these images near the top or bottom of the page? Are they repeated in the same place on multiple pages?

Participating Frequently
May 3, 2023

Most of them are same images repeated top and bottom of the pages(eg: Header logo). 

Joel Geraci
Community Expert
Community Expert
May 3, 2023

Ok - That explains it. At this time, Extract ignores what it considers to be a header and footer.  I've already submitted a feature request to include all page elements in a future release.