Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

On exporting an interactive form to HTML the text-boxes checkboxes are not exported.

New Here ,
Oct 26, 2016 Oct 26, 2016

I want to convert an editable PDF form to HTML and for this I downloaded and installed the trial version of Adobe Acrobat Pro DC.

The PDF form is converted to HTML and it's format is intact but the HTML doesn't contain any input controls. The text-boxes, checkboxes, radio buttons and submit buttons are not present in the HTML, it just has the background image with borders for the input controls.

Is Acrobat Pro capable of exporting the editable PDF forms to HTML with proper input tags and buttons? What am I missing?

Any help is appreciated.

TOPICS
Acrobat SDK and JavaScript
635
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 26, 2016 Oct 26, 2016

This is not possible with Adobe Acrobat.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 27, 2016 Oct 27, 2016

Thanks Bernd for your quick response.

I've another question: Does Adobe Acrobat Pro has command line utility or API for converting PDF to HTML?

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 27, 2016 Oct 27, 2016

For the API look at the Acrobat SDK.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 27, 2016 Oct 27, 2016

I checked the Acrobat DC SDK and unable to find any API for saving PDF to HTML.

Any pointer to the API method or which class to look into will be helpful.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 27, 2016 Oct 27, 2016

In the JavaScript reference the saveAs method.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 27, 2016 Oct 27, 2016

Thanks for the pointer Bernd... I'm able to save pdf as html using the JavaScript reference saveAs method.

For few sample PDFs it converts to HTML with proper formatting but for others the PDF formatting is lost in HTML, i.e., images and text are not properly aligned in HTML output.

You can view the sample PDF and its output at:

https://private.filesanywhere.com/Filesanywhere/fs/v.aspx?v=896b66885fa5b7ad72

Below is the VB.net code I'm using for conversion:

Dim srcDoc As Acrobat.CAcroPDDoc = CreateObject("AcroExch.PDDoc")

srcDoc.Open(sPDFPath)

Dim jsObj As Object = srcDoc.GetJSObject()

jsObj.saveAs(sHtmlPath, "com.adobe.acrobat.html")

Am I missing something? Is there any property or attribute of the JSObject to be set to get the perfect html output?

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 27, 2016 Oct 27, 2016
LATEST

There is no such attribute or property.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines