Text Extraction from Native PDF
Hi,
Adobe PDF version may keep varying depends on the customer..
Hardware - VM Server with good configuration
Requirement / Scenario (PDF may be application form based or native text which is generated by the system)
- Tabular data without border (border less)
- Checkbox data in all different forms
- special characters
- Text inside the Image
- Free text (like signature)
Solution Tried:
We are using C# code through which we have tried to read the data from the PDF for the above scenario and didn't get the expected output.
we used ITextSharper library and used the same from C# to extract the content.
Any other library or suggestion would be really helpful.
