• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers
Exit
0

Extract Text from pdf using C#

Guest
Mar 29, 2012 Mar 29, 2012

Copy link to clipboard

Copied

Hi,

We are Solution developer using Acrobat,as we have reuirement of extracting text from pdf using C# we have downloaded adobe sdk and installed. We have found only four exmaples in C# and those are used only for viewing pdf in windows application. Can you please guide us how to extract text from pdf using SDK in C#.

Thanks you for your help.

Regards

kiranmai

TOPICS
Acrobat SDK and JavaScript

Views

36.7K

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines

correct answers 1 Correct answer

Explorer , Apr 04, 2012 Apr 04, 2012

Try page 135 of this document

http://wwwimages.adobe.com/www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/plugin_apps_developer_guide.pdf

More than likely, you'll need to write a plugin to handle this because IAC doesn't seem to support it through COM

If you are creative in using the JS interface, you can extract all the words from a document. You would need to use a loop and put everything into an array or a List.
Take a look at page 311 of

http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/js_api_reference.pdf

Votes

Translate

Translate
Guest
Mar 14, 2013 Mar 14, 2013

Copy link to clipboard

Copied

no i dont have any idea how to communicate plugin with my application, based on your suggestion to use plugin to get word properties i just started to create plugin, i am trying to build all the sample plugins provided by sdk but non of them are working. basically my requirement is to find different properties of words extracted from pdf like font size, font weight and style etc.. . based on my research i came to know that by using pdfwordfinder we can extract words from pdf, but i didnot find that plugin so i am trying to create that plugin with the help of document but i am not able to do. can you please suggest whether we already have pdfwordfinder plugin or i have to create

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Mar 14, 2013 Mar 14, 2013

Copy link to clipboard

Copied

PDWordFinder is not a plug-in. It is a collection of methods you can use in your own plug-in.

A plug-in installs a series of routines to Acrobat which are called on certain events.

For example, a plug-in can add a menu item to Acrobat. When the user clicks your menu item, your routine is called. Then it can do the required task and report results. (Usually by popping up a message on screen if required, but it is not limited to this.)  Are you happy to have your function called from a menu by the user of Acrobat?

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Guest
Mar 14, 2013 Mar 14, 2013

Copy link to clipboard

Copied

ok, can we use plugins in c#, because i am developing application in c#. i will try to create plugin but i am not able to solve the error that i am getting when building sample apis. if i am not able to solve that error then i cannot work on exsisting sample api nor i cannor create new api, please suggest why i am getting that error and how to solve it.

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Mar 14, 2013 Mar 14, 2013

Copy link to clipboard

Copied

LATEST

"i am developing application in c#."

I recommend moving the whole application into a plug-in, where it will be convenient for the end user of Acrobat to run your PDF functionality in one place.

"please suggest why i am getting that error and how to solve it."

I have done so. Twice. Is my advice confusing, or does it seem not to apply to your problem?

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines