Copy link to clipboard
Copied
I have seen tutorials on extracting text content from pdf using pdfservices-sdk. I would like to understand how to extract images in the pdf using the sdk(python).
I have tried extracting the images using pdfextract api. i would like to do the same using pdfservices-sdk
Copy link to clipboard
Copied
I'm not sure I understand your question. PDF Extract is one of our APIs, and the PDF Services SDKs are simply wrappers to use our APIs, all of them. You install one SDK (if you want, we have nice REST APIs) and get access to use any API.
And yes, Extract can extract the images, we've got docs showing you how to do that. Have you tried it yet?
Copy link to clipboard
Copied
Hi Raymond,
Thank you for the quick response. I would like to highlight my exact requirement.
Lets consider I have a pdf which contains images along with texts. I am able to extract the texts from it using pdfservices-sdk by following the tutorial. Additionally, the other requirement is to download the images present in the same pdf. And I would like to use pdfservices-sdk for it as well.
Unfortunately I could not find a helpful documentation regarding the same. If you could provide any leads/links to the documentation would be really appreciated
Thanks in advance
Copy link to clipboard
Copied
You want this, https://developer.adobe.com/document-services/docs/overview/pdf-extract-api/howtos/extract-api/#extr... and click on the Python tab.
Copy link to clipboard
Copied
I didn't see exemples for Figures. Only for text and table. Is it normal ?
Copy link to clipboard
Copied
There may not be an example for every way of using the API, but have you read the docs?