Read pdf programatically
Is there a way to read pdf pro-grammatically?
reading of text would be helpful to start with. in java preferably.
Is there a way to read pdf pro-grammatically?
reading of text would be helpful to start with. in java preferably.
Ok, the server issue is extremely important. Adobe's main programmatic interfaces are offered through the Acrobat SDK. This isn't really a product, just a name for documentation of the interfaces of Acrobat (not Reader). (You've posted in the Acrobat Reader forum, but I assume you know the difference between Acrobat and Acrobat Reader).
Your key issue would be that Acrobat is not for server use. Not licensed and (though it is irrelevant) not technically suitable. So there's no point looking at it if the final deployment is a server. Instead Adobe offer the Adobe PDF Library, which can be licensed (on a royalty basis, price by negotiation) for server use. It has a C/C++ interface which is in fact similar to part of the Acrobat SDK.
But there is nothing to do with text extraction from tables, because there is no such thing in a PDF. Are you familiar with the PDF specification, ISO 32000-1? There's nothing there but text and lines, with no connection between them.
Already have an account? Login
Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.