Question
Parsing PDF Files
Hi everyone. So I have a bunch of pdf files that contain text
data which I need to retrieve. Is there any way to parse it? The
data is stored with the first line has the column names followed
delimited commas. Every line after is one row of data.
ex:
id,fname,lname,age
532,Tom,Stevens,33
42,John,Baldwin,38
...
I've tried using the cfpdf tag, but as far as I know that tag is mainly used for pdf creation, not extraction.
It does have a read function: <cfpdf action="read" source="data.pdf" name="mypdf"> but I don't know how to use the 'mypdf' variable after I read the file. If I dump it out, it just shows details about the pdf file, but not what it contains.
Thanks for any help you can provide.
ex:
id,fname,lname,age
532,Tom,Stevens,33
42,John,Baldwin,38
...
I've tried using the cfpdf tag, but as far as I know that tag is mainly used for pdf creation, not extraction.
It does have a read function: <cfpdf action="read" source="data.pdf" name="mypdf"> but I don't know how to use the 'mypdf' variable after I read the file. If I dump it out, it just shows details about the pdf file, but not what it contains.
Thanks for any help you can provide.