Extracting selective Text from pdf

Question

Is it possible to extract text from a pdf on the basis of any properties of that text, like its font?

I have a set of pdfs and an excel file.

The set of pdfs have different types of fields or questions(the general content of the pdfs is same but questions slightly vary for each pdf document ) and a more general list of all the questions and many more are in the excel file.

I wanna know which of these questions(in excel file) are present in each of the documents and make a matrix of that.

I could use the advanced search but due to the nature of the data i have to perform the search on the excel file using the pdf questions.

If anything is possible pls help

Thanks in Advance

try67 · Answer

I don't believe that's possible. A script might be able to extract texts of a specific font size, but not based on the font type itself.

It probably can be done using a stand-alone tool or maybe even a plugin, though.

Sign up

To post, reply, or follow discussions, please sign in with your Adobe ID.

Sign in to Adobe Community

To post, reply, or follow discussions, please sign in with your Adobe ID.

Scanning file for viruses.

This file cannot be downloaded