How to? Automatic extraction and naming of all single pages of a PDF-file
Hello everybody!
I'm elaborating 8 millions of hand made index cards (paper, size A6 approximately) for a dictionary, that have been scanned as pictures, saved as PDF (400 dpi, color) and gathered in documents of nearly 100 pieces each. I have therefore nearly 75.000 single documents. The next step is the digitalization of the content of each card, passing their standard informations in a kind of a relational database. I want do automatize this process as much as possible. (If I accelerate the process of elaborating a single card of 1 sec., it means I have saved 7,5 millions of seconds = 270 working days of 8 hours! (If you want to see an example, please feel free to email me at themistokliu-at-hotmail-dot-de.)
The 100 digital cards of a document have the same "title" or belongs to the same word.
My question:
Is there any method to automatically split the 100-page-PDF-document in 100 single PDF-documents, which are automatically named by the title? For example, if a document named HOWTO.pdf has three pages, and these have respectively for example the titles PAGEONE (written with the function "TEXT" on the top of the page), PAGETWO and PAGE THREE. Is it possible to automatically generate three single PDF-documents by extracting them, which are automatically and respectively named by the titles of the single pages, i.e. PAGEONE.pdf, PAGETWO.pdf and PAGETHREE.pdf?
Thank You very much in advance for helping me to save a lot of time...
Jack
