Copy link to clipboard
Copied
Hi All, I'm new to this community and am simply looking to see what's needed to support programmatic conversion of PDFs to Word as part of a 3rd party application. Can somebody please point me in the right direction? I'm assuming that I need an SDK, but which one, and if you have a sample call or doc reference that'd be ideal! Thanks.
The only interface I know of is the JavaScript doc.saveAs method. Doc and Doc.Media APIs — Acrobat-PDFL SDK: JavaScript Reference (adobe.com)
Notes
1. This could only work if the user is instructed to turn off Protected Mode in Acrobat. Some companies forbid this.
2. To run Acrobat JavaScript from an external app there is a special VB:JavaScript interface and a call to execute one line of JavaScript (simpler)
3. There are further security limitations on saveAs, as noted in the linked documentation.
...Copy link to clipboard
Copied
Adobe have a number of options. The Acrobat SDK is a tool for automating Acrobat. Acrobat is a $20/month subscription which your end user must pay (you cannot bundle, share or host). If that suits your business model we could tell you more about the -rather clunky and limited - API.
Copy link to clipboard
Copied
Thank you. Yes, I'd like to hear more. I'd like conversion from PDF to Word to be automated when the PDF has been uploaded to our product.
Copy link to clipboard
Copied
The only interface I know of is the JavaScript doc.saveAs method. Doc and Doc.Media APIs — Acrobat-PDFL SDK: JavaScript Reference (adobe.com)
Notes
1. This could only work if the user is instructed to turn off Protected Mode in Acrobat. Some companies forbid this.
2. To run Acrobat JavaScript from an external app there is a special VB:JavaScript interface and a call to execute one line of JavaScript (simpler)
3. There are further security limitations on saveAs, as noted in the linked documentation. This might mean you can't use it.
4. Start with a basic test of the JavaScript command in the JavaScript console to see whether the result is worth aiming for.
5. Be aware that PDF -> Word is not a precise process. Word cannot handle the complexity of layouts found in PDF, and Word really wants to reflow to a completely new layout. Acrobat sometimes turns parts of the page into uneditable graphics to keep layout. Experiment carefully with your use case.
6. If you are planning to use this as a method to convert format X to Word, by going X -> PDF -> Word, I suggest you think again. Countless people have seen PDF as a simple way to get things converted, and much pain has resulted.
Copy link to clipboard
Copied
Thank you for steering me to this information. Will explore from here, mostly regarding #5 as it's important that we get the same look-n-feel. As for #6, we would typically be going from PDF (PDFa) to DOCX.
Thanks again. You've been extremely helpful.