Highlighted

Utility to separate text from images?

New Here ,
Dec 12, 2019

Copy link to clipboard

Copied

We have a business process in which we import very large numbers of pdf files. Most of the files are all text, so the file sizes are small. Some of the files have embedded images, and the file sizes get much larger, quickly. A 30-50MB file is not uncommon.

 

Is there a utility or tool available that would could incorporate into the import process that would separate a pdf into, for example, two files with one containing the text and the other containing the images? I supposed we could also run it as a post-import process, by pointing it at a specified list of recently arrived files.

 

The keys are maintaining fidelity to the original text, and not needing human handling of each file.

 

Is there any tool or utility available that would do this?

 

Thanks, 

Steve

 

Topics

General, How to

Views

144

Likes

Translate

Translate

Report

Report
Community Guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more

Utility to separate text from images?

New Here ,
Dec 12, 2019

Copy link to clipboard

Copied

We have a business process in which we import very large numbers of pdf files. Most of the files are all text, so the file sizes are small. Some of the files have embedded images, and the file sizes get much larger, quickly. A 30-50MB file is not uncommon.

 

Is there a utility or tool available that would could incorporate into the import process that would separate a pdf into, for example, two files with one containing the text and the other containing the images? I supposed we could also run it as a post-import process, by pointing it at a specified list of recently arrived files.

 

The keys are maintaining fidelity to the original text, and not needing human handling of each file.

 

Is there any tool or utility available that would do this?

 

Thanks, 

Steve

 

Topics

General, How to

Views

145

Likes

Translate

Translate

Report

Report
Community Guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
Dec 12, 2019 1
sangy LATEST
New Here ,
Feb 10, 2020

Copy link to clipboard

Copied

Hi,

Even I would like to know more about this kind of facility. Most of the time my business website involves more than 1000 words of text and images embedded. Other than there is a title on the images as well as the alt text. I need a tool that keeps everything intact and allows me to separate them in a click.  My other concern is the blog page. I need my webpage to easily recognize the PDF that I upload and fill in the smart form that we get in Shopify to add the content, images, links, and everything in place. 

 

Thanks,

SangyK

Pure-elegance

 

 

Likes

Translate

Translate

Report

Report
Community Guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
Reply
Loading...
Feb 10, 2020 0