API integration with ROR and extracting column based content from pdf

Community Beginner ,
Aug 10, 2021 Aug 10, 2021

Copy link to clipboard

Copied

Hi There,

 

I would like to read a pdf and extract content based on search strings provided (via code).

This content can be in column formats as well. Hence the service should be able to identify and return content accordingly.

I am using Ruby on Rails. Hence would like to know if these API's can be incorporated in Ruby on Rails.

 

Regards,

Mru

 

TOPICS
PDF Extract API , PDF Services API

Views

46

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Aug 10, 2021 Aug 10, 2021

Copy link to clipboard

Copied

Just so you know: PDF files don't contain columns. Only text, which might appear on screen as columns. There are no markers in the file to say "here is column 1" etc.

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Aug 10, 2021 Aug 10, 2021

Copy link to clipboard

Copied

Thanks for your reply.

Yes, I am referring to the text being present in columns.

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Aug 10, 2021 Aug 10, 2021

Copy link to clipboard

Copied

But I am telling you the text is not present in columns. It only looks that way.

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Aug 10, 2021 Aug 10, 2021

Copy link to clipboard

Copied

Okay.

Request you to suggest which APIs I should be using and how I could integrate it with RoR.

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Aug 10, 2021 Aug 10, 2021

Copy link to clipboard

Copied

LATEST

I am not an expert on the APIs, but an expert on the internals of PDF.  

I am pointing out that wanting to extract in columns may not be possible. If the APIs do not offer this option, do not be surprised.

Likes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Resources