Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Converting PDF to Excel and Issue with Structure and Character Artifacts

New Here ,
May 16, 2016 May 16, 2016

I am trying to convert a standard report from a tax software application to Excel.  Screenshots of a standard snippet from the PDF report and the Excel conversion result are below.  I think the starting point is finding some way to systematically remove the box graphics; once those are removed, a lot of the structure issues are easier to address.  I think.

But I am desperate.  The software vendor has confirmed they have no way to get these reports in Excel rather than PDF, not that they're willing to invest in, at least.  But figuring this out would save a lot of time.

PDF with Structure and Formatting:

PDF.png

Excel Conversion Result:

Excel.png

TOPICS
Acrobat SDK and JavaScript
412
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
May 17, 2016 May 17, 2016

One update, I was able to figure out that the root cause is the wingdings font.  If I can select and delete all the instances of text with wingdings font I think that will go a long way towards clearing this up.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
May 17, 2016 May 17, 2016
LATEST

Hi Nathan,

Please share the pdf file and the excel file @ agarwala@adobe.com

Thanks and Regards,

Girija

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines