Skip to main content
Participant
July 26, 2010
Question

Export PDF to XML

  • July 26, 2010
  • 2 replies
  • 21062 views

Hi,

I am trying to export pdf to xml using Adobe Acrobat Professional.

I can export the data pretty nicely, but it is not exporting the headers/Footers from the PDF.

Is there a way to extract headers/footers of the pdf document?

Thanks

AJ.

This topic has been closed for replies.

2 replies

Participant
January 13, 2012

DAJINKYA wrote:

Hi,

I am trying to export pdf to xml using Adobe Acrobat Professional.

I can export the data pretty nicely, but it is not exporting the headers/Footers from the PDF.

Is there a way to extract headers/footers of the pdf document?

Thanks

AJ.

Another way you can do that is to use EDI Link Connect. It can export data from a PDF (headers & footers included). The XML will be structured properly so you can immediately import into the program of your choice. Its intended for Business documents like Orders,Invoices,Shipping,Reports etc. I'm not sure if thats the type of PDFs you're looking for but if it is, that might be something to look at. Here is a link with more info:

Converting PDFs to XML with EDI Link:http://ecdynamics.com/pdf-conversion.php

as well as another article:

http://softertech.wordpress.com/2011/12/12/importing-pdfs-into-quickbooks-or-simply-accounting/

lrosenth
Adobe Employee
Adobe Employee
July 27, 2010

PDF documents don't have "headers" or "footers" - it's all simply page content.

Also, you don't mention what method(s) you are using to export the XML and with what version of Acrobat and SDK.

DAJINKYAAuthor
Participant
July 27, 2010

Irosenth,

Apology for incomplete information, I am using Adobe Acrobat 9.0 Pro.

And the way I am exporting it to xml is "File->Export->XML" or "File->SaveAs->xml"

Well, our pdfs are converted using some free java library, it a word document which has header & footer, and then it is converted into pdf using that java library.

So when I export that pdf to xml from adobe acrobat pro, I don't see header and footer value in the xml, rest all looks fine.

lrosenth
Adobe Employee
Adobe Employee
July 27, 2010

That area may be identified as an artifact, so it isn't getting exported. Without seeing a file, it's difficult to say.