Extract text from pdf with javascript
I have a slew of pdf documents, from which I need to extract data. I am using 32-bit Adobe Acrobat Pro. I believe the files were originally scanned from excel, but I have no way of knowing. I have tried to pull the data using various tools, including the most straight forward method - extracting to excel. This works, but I am looking for a more elegant solution. I would like to use javascript to iterate through the documents, which all have the same structure. My current stumbling block is that there are undefined fields, mostly text fields, and I am not familiar enough with the object model to be able to iterate through programatically. How can I iterate through using the debugger to list each field? When I look at the document as a form, the fields I need to identify have no properties windows, but I am assuming I can still manipulate them with a script.
Thanks! (I can't upload a sample file, btw)
