Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Text Extraction from Native PDF

New Here ,
Mar 09, 2017 Mar 09, 2017

Hi,

Adobe PDF version may keep varying depends on the customer..

Hardware - VM Server with good configuration

Requirement / Scenario (PDF may be application form based or native text which is generated by the system)

  1. Tabular data without border (border less)
  2. Checkbox data in all different forms
  3. special characters
  4. Text inside the Image
  5. Free text (like signature)

Solution Tried:

We are using C# code through which we have tried to read the data from the PDF for the above scenario and didn't get the expected output.

we used ITextSharper library and used the same from C# to extract the content.

Any other library or suggestion would be really helpful.

TOPICS
Acrobat SDK and JavaScript
1.7K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Aug 24, 2017 Aug 24, 2017

Hi Mohana,

Even I am looking for a solution for the same kind of problem.

I have to read text from image in a pdf document.

What was the solution you went ahead with ? iTextSharp ?

Do we have a solution for this in the Adobe PDF Library? I have downloaded the sample sdk but do not find any method which can do this. We have DocToImages method, what I want is something like ImagesToDoc feature.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Explorer ,
Sep 14, 2017 Sep 14, 2017

We have a lot of experience extracting text using the Adobe PDF Library Please contact me if you need additional help.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Sep 14, 2017 Sep 14, 2017

Can you provide you contact details, I am also looking for extracting text from pdf by removing header/footer and images

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Explorer ,
Sep 15, 2017 Sep 15, 2017

Michael Peters

mpeters@mapsoft.com

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 10, 2017 Oct 10, 2017

Michael I have already sent email on this 4 weeks back but haven't received any email from your side on this.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Explorer ,
Oct 11, 2017 Oct 11, 2017
LATEST

Sorry - found it in my spam folder and have now replied.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines