• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers
Exit
0

Extract text from a group of regions to CSV

New Here ,
Jan 05, 2023 Jan 05, 2023

Copy link to clipboard

Copied

Hello, I need to create drawing indexes for a set that contains over 1000 drawings.  I am able to do this with BlueBeam by creating Bookmarks. But some of our subs don't use BB so I am wondering if it is possible with Acrobat to extract text from a group of regions on each page to a CSV file.

 

The BlueBeam workflow is as follows:

Create Bookmarks that concatinate a series of selected regions from a page

K27817525uslm_0-1672940699764.png

By drawing separate boxes around the Drawing No, Revision No and Dwg Title (placing a colon btwn each) I can assemble a bookmark that contains these 3 pieces of information.

K27817525uslm_1-1672940996987.png

The result is a list of bookmarks that has extracted the text information from those same 3 regions for the entire set of dwgs.

K27817525uslm_2-1672941228462.png

This list can then be exported to a CSV file

K27817525uslm_4-1672941995757.png

Using the excel import wizard this info can be placed into a table for easy manipulation.

K27817525uslm_3-1672941557798.png

K27817525uslm_5-1672942246195.png

So, my hope is that Acrobat/Pro can extract the text info from each pdf page and assemble it in excel in a similar fashion.

 

TOPICS
Create PDFs , How to , Scan documents and OCR

Views

552

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 06, 2023 Jan 06, 2023

Copy link to clipboard

Copied

Yes, this is possible, using a custom-made script.

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Jan 06, 2023 Jan 06, 2023

Copy link to clipboard

Copied

Ok, I assume you are refering to JavaScript? If so, that is out of my depth atm.

Or could you elaborate pls?

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 06, 2023 Jan 06, 2023

Copy link to clipboard

Copied

Yes, this is not a trivial task, but I've developed similar scripts in the past. If you're interested in hiring a professional to create it for you, feel free to contact me privately by clicking my user-name and then on "Send a Message".

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 06, 2023 Jan 06, 2023

Copy link to clipboard

Copied

LATEST

Hi,

Try this script in the action wizard attached in this file:

 

 

theText="Region 1;Region 2;Region 3";
function extractBookmark(bkm,nLevel) {
	if (bkm.name!="Root") theText+="\r"+bkm.name;
	if (bkm.children != null) for (var i=0; i<bkm.children.length; i++) extractBookmark(bkm.children[i],nLevel+1);
}
extractBookmark(this.bookmarkRoot,0);
this.createDataObject("theData.csv", "");
var oFile=util.streamFromString(theText);
this.setDataObjectContents("theData.csv", oFile);
bookmarkRoot.remove();
event.target.viewState={overViewMode:7};
this.saveAs(this.path.replace(/.pdf$/i," & data .csv file attached.pdf"));

 

 

Capture d’écran 2023-01-07 à 00.19.48.pngCapture d’écran 2023-01-07 à 00.20.09.png

@+

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines