We creating a process for reading some pdf files and convert them to some type of xml,
we saw that at least regarding tables, Acrobat has good output.
Our question is if it possible to use the .net api for exporting pdf to xml as it does in the acrobat pro dc software?
Thank for any lead,
Would this be used on a Mac or Windows computer where the user already has Acrobat Pro installed, with an active subscription of their own (Making paying Adobe for an Acrobat Pro Subscription a condition of using your process)? Or is it something else such as a web service?
Why does it matter?
We can start by using it on specific computer with license, in that case does it exist?
It matters because it CANNOT be run on a web service. No license exists for this. So, it would waste your time if I helped you use the Acrobat SDK to make a prototype it was impossible to run in the way you plan.
Thank you, i understand.
The license issue is also limited the option for exporting the pdf as docx? (it's unallowed to run it in a bulk, or inside a web serivce?)
The license issue is for running Acrobat. The Acrobat SDK is just info on how to talk to Acrobat. So, yes, everything you might automate with the SDK is affected by the license.
If you want a web based solution, you might look at Adobe offerings not based on Acrobat, such as here: https://www.adobe.io/apis/documentcloud/dcsdk/
NB they may not use common code or produce identical results. Test carefully, since extraction/conversion is a very uncertain area.
Also, even on a regular computer it is only licensed for single user, interactive use. It cannot be used, under the terms of the license, for a batch or bulk work triggered from the outside. Acrobat may not be the tool you need... hence the questions.