Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

How to extract tables without borders from a pdf using Acrobat Pro 2020

New Here ,
Oct 04, 2022 Oct 04, 2022

Hello, I am trying to extract a table of data from a pdf using Acrobat. This table is from a research article and doesn't have column or row borders. Unfortunately, when I try to extract the table to excel, it ends up extracting the data with rows merged or with a table row divided into two vertical cells, which is very time-consuming to fix. See below

 

table.pngtable excel.png

 

TOPICS
Edit and convert PDFs , How to
4.8K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 04, 2022 Oct 04, 2022

You won't get any better results. Acrobat hat no clue, what a PDF visually represents to you. It can interpret vartical spaces as columns and line breaks as rows, but it can't determine if a row should contain multiple lines of text or not.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 04, 2022 Oct 04, 2022

But it looks like it's having trouble even interpreting where the columns and rows are at.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 04, 2022 Oct 04, 2022

Do you have access to the document that produced the original pdf? Using the original might produce better results, you can often determine the software that created a pdf by going to File> Properties> Description. If you don't need to use the chart for anything other than a visual reference, you may be able to add the pdf to your Excel file as an image (I don't currently use Excel, so I'm guessing here). You could also try to export to spreadsheet, then copy and paste, unless you have already tried this.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 04, 2022 Oct 04, 2022

No, unfortunately, I don't have access to the original file. My goal is to be able to extract this data and create another table by adding this data to other data and analyzing them all in aggregate.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 04, 2022 Oct 04, 2022

So you did try exporting as a spreadsheet? Try exporting as Word, then copy/paste into Excel, also try saving as rich text, with a bit of luck, one method might show some improvement.

In the past, if I needed to recreate a table (in InDesign), I would place the table as an image, reduce the opacity to 50%, then re-set all new copy on top of the original in a different color, then delete the image when done, the advantage was it's easy to catch any errors.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 04, 2022 Oct 04, 2022

Yes, the screenshot above is the result of that export. Word and rtf also have the same problem. Is there a way to manually set where the column and row boundaries are?

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 04, 2022 Oct 04, 2022

No, not that i'm aware.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Oct 04, 2022 Oct 04, 2022
LATEST

As mentioned, you can't change the output Acrobat creates. However, it might be possible to get better results using a custom-made script, especially if the columns are always the same size. It's not a simple task, though. If you're interested in hiring a professional to create such a script for you, feel free to contact me privately (click my username and then on Send a Message to do so).

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines