Copy link to clipboard
Copied
Hi evryone,
I’ve been using the PDF to XML conversion in Acrobat pretty often, and it’s been working well so far. But now, I’m working on a more complex app and I’m looking for an online API that could handle this conversion in a more integrated way.
I stumbled upon this tool: Coolutils, though I haven’t had the chance to try it yet. I’m also wondering if Adobe might offer something similar through Creative Cloud.
While browsing, I found this link: Adobe Tools, but didn’t come across anything specific for XML conversion or an API that can be accessed from a third-party app.
I also came across this service from Adobe: PDF Services API, which I’ve subscribed to, but haven’t received any response yet.
I haven’t had much luck finding a solution so far. If anyone has suggestions—whether it's URLs, forums, or online resources—they’d be much appreciated!
thanks all... have a plesant day...
birnou
Copy link to clipboard
Copied
Before diving into a more in-depth study, I wanted to reach out to all of you to see if you have any feedback or thoughts on the topic.
As for the extraction API, I still haven’t heard back from Adobe, so I continued my search and here’s what I’ve found:
Zamzar offers 100 free tests and then a minimal subscription starting at 25 euros per month. However, it doesn't seem to support XML, only XLSX: https://developers.zamzar.com/docs
PDF.co seems to be more flexible with the number of tests and covers a wide range of languages like Node, PHP, and Python: https://developer.pdf.co/api/pdf-to-xml/index.html
DocParser has a limited trial period (14 days), so it’s best to be available and not have too many big tasks running in parallel when starting your first test. After that, it’s $32 per month. I couldn’t find much documentation on their REST API, and when PDF is mentioned, it seems more about PDF Forms, which might need further investigation: https://docparser.com/
iText PDF seems to be only focused on Java and .NET: https://itextpdf.com/resources/api-documentation. I would personally prefer something for Node or PHP.
PDFBox: Same situation, it's Java-only: https://pdfbox.apache.org/
Convert API looks very comprehensive, but doesn’t seem to handle PDF to XML conversion: https://www.convertapi.com/api/tag/pdf. I also couldn’t find any trial information, and the subscription starts at $30 per month.
If you have any insights or suggestions on any of these, or know of other options, I’d love to hear them! Does anyone have any information, or perhaps an URL I might have missed, regarding Adobe’s REST API for extraction?
Copy link to clipboard
Copied
Hi @B i r n o u ,
Thank you for reaching out. The Adobe PDF Services API provides modern cloud-based capabilities for PDF manipulation. The API is accessible through SDKs which help you get up and running quickly. To begin with you start reading for Export from here : https://developer.adobe.com/document-services/docs/overview/pdf-services-api/howtos/export-pdf/ .
Regards,
Divya Kumar Singh
Copy link to clipboard
Copied
Hello @divyasin , thanks for your feedback and sorry for the delay, I've had quite a bit of delay in email management.
During this time I was able to investigate and I was contacted by the beta service (automatic reply with registration link), unfortunately, the SDK seams apparently only JAVA oriented...
On the other hand, digging into the link you provide, it looks like a REST API might be what I'm looking for... good to investigate as export to XML doesn't seem to be supported, or documented
Can you, please, confirm that the beta program can cover this type of external use integrated with third-party applications (PHP or Node.js and Export to XML), or is it JAVA-only regarding API deployment?
If so, is it possible to speak to the development team before finalizing registration?
Thanks again for your attention and have a pleasant day
birnou