Copy link to clipboard
Copied
I am using the extract API for PDF tables to export to an Excel file (pdfservices-node-sdk 3.2.0). In the same PDF file, there are 4 identical tables, but when exporting, only 2 tables are extracted. What factors could be affecting the export of tables? How can I adjust to achieve the best results?
Only the table highlighted in red is successfully exported to Excel.
Copy link to clipboard
Copied
Copy link to clipboard
Copied
It's an AI. The code to do the page segmentation can be off when deciding if something is a figure vs a table vs. text. Honestly, given the proximity to the drawings above the tables, I'd have thought that the bottom two tables would get read and not the top 2.
Unfortunately, there are no "knobs" to turn to get better results but with your permission, I can send your file to engineering to train for this sort of thing.
Copy link to clipboard
Copied
Thanks you so much
Copy link to clipboard
Copied
Sure. Thanks you so much.
Copy link to clipboard
Copied
"There's a bit of confusion. More accurately, figures 2 and 4 have Excel table results, while figures 1 and 3 cannot generate an Excel file."
Copy link to clipboard
Copied
can you please share the code of extracting table and savign them to a excel file?
Copy link to clipboard
Copied
It's in the samples.
Get ready! An upgraded Adobe Community experience is coming in January.
Learn more