Skip to main content
Participant
January 4, 2023
Answered

Programmatic conversion of PDF to Word

  • January 4, 2023
  • 1 reply
  • 915 views

Hi All, I'm new to this community and am simply looking to see what's needed to support programmatic conversion of PDFs to Word as part of a 3rd party application.  Can somebody please point me in the right direction?  I'm assuming that I need an SDK, but which one, and if you have a sample call or doc reference that'd be ideal!  Thanks.

This topic has been closed for replies.
Correct answer Test Screen Name

The only interface I know of is the JavaScript doc.saveAs method. Doc and Doc.Media APIs — Acrobat-PDFL SDK: JavaScript Reference (adobe.com)

Notes

1. This could only work if the user is instructed to turn off Protected Mode in Acrobat. Some companies forbid this.

2. To run Acrobat JavaScript from an external app there is a special VB:JavaScript interface and a call to execute one line of JavaScript (simpler)

3. There are further security limitations on saveAs, as noted in the linked documentation. This might mean you can't use it.

4. Start with a basic test of the JavaScript command in the JavaScript console to see whether the result is worth aiming for.

5. Be aware that PDF -> Word is not a precise process. Word cannot handle the complexity of layouts found in PDF, and Word really wants to reflow to a completely new layout. Acrobat sometimes turns parts of the page into uneditable graphics to keep layout. Experiment carefully with your use case.

6. If you are planning to use this as a method to convert format X to Word, by going X -> PDF -> Word, I suggest you think again. Countless people have seen PDF as a simple way to get things converted, and much pain has resulted.

1 reply

Legend
January 4, 2023

Adobe have a number of options. The Acrobat SDK is a tool for automating Acrobat. Acrobat is a $20/month subscription which your end user must pay (you cannot bundle, share or host). If that suits your business model we could tell you more about the -rather clunky and limited - API. 

Participant
January 4, 2023

Thank you.  Yes, I'd like to hear more.  I'd like conversion from PDF to Word to be automated when the PDF has been uploaded to our product.

Test Screen NameCorrect answer
Legend
January 6, 2023

The only interface I know of is the JavaScript doc.saveAs method. Doc and Doc.Media APIs — Acrobat-PDFL SDK: JavaScript Reference (adobe.com)

Notes

1. This could only work if the user is instructed to turn off Protected Mode in Acrobat. Some companies forbid this.

2. To run Acrobat JavaScript from an external app there is a special VB:JavaScript interface and a call to execute one line of JavaScript (simpler)

3. There are further security limitations on saveAs, as noted in the linked documentation. This might mean you can't use it.

4. Start with a basic test of the JavaScript command in the JavaScript console to see whether the result is worth aiming for.

5. Be aware that PDF -> Word is not a precise process. Word cannot handle the complexity of layouts found in PDF, and Word really wants to reflow to a completely new layout. Acrobat sometimes turns parts of the page into uneditable graphics to keep layout. Experiment carefully with your use case.

6. If you are planning to use this as a method to convert format X to Word, by going X -> PDF -> Word, I suggest you think again. Countless people have seen PDF as a simple way to get things converted, and much pain has resulted.