Skip to main content
Participating Frequently
September 19, 2017
Answered

convert pdf to xml

  • September 19, 2017
  • 1 reply
  • 5749 views

How do I convert PDF document to XML using acrobat SDK

Please provide any links in this regard

  1. Remove header/footer from pdf, PDF’s are not tagged pdf and header footer size is also not fixed across multiple documents. 
  2. Remove Table of content from PDF.

This topic has been closed for replies.
Correct answer lrosenth

There are APIs for doing this from a plugin, from JavaScript and from IAC. All of which is documented in the SDK along with sample code.

1 reply

Legend
September 19, 2017

Which XML schema? Or, what manual selection are you trying to automate?

I do not see the connection between points 1 and 2 and the XML conversion, either, sorry.

Participating Frequently
September 19, 2017

No there is no schema, it is different format pdf.  I want to convert PDF to text/xml by removing header/footer and toc.

Basically whatever Acrobat DC PRO does using export option. Export to Text/XML/HTML. Using DC Pro editor we are able to achieve this , same we need to achieve using API.

Thanks for your reply

lrosenth
Adobe Employee
lrosenthCorrect answer
Adobe Employee
September 19, 2017

There are APIs for doing this from a plugin, from JavaScript and from IAC. All of which is documented in the SDK along with sample code.