Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

bulk conversion of PDF files?

Community Beginner ,
Sep 09, 2018 Sep 09, 2018

I have 231,353 cast vote record PDF files. Short of manually opening each file in Acrobat and exporting it, is there a way to convert these files to text or CSV, either as individual files or as one large file? Or is there an API I can use to access the data from the PDF files? Would prefer using perl.

Thank you.

[Moved from the non-technical Lounge to an Acrobat forum... Mod]

[Here is the list of all Adobe forums... https://forums.adobe.com/welcome]

TOPICS
Edit and convert PDFs
2.9K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
1 ACCEPTED SOLUTION
Community Beginner ,
Sep 26, 2018 Sep 26, 2018
LATEST

Looks like

codeproject.com/Articles/7056/Code-to-extract-plain-text-from-a-PDF-file

will get me on my way. I suspect some assembly will be required.

View solution in original post

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Sep 09, 2018 Sep 09, 2018

That's way too much for Acrobat to be able to handle in a single process.

I don't know about Perl, but I've developed tools in Java that can process large amounts of files in this way.

If you're interested in purchasing such a tool feel free to contact me privately via try6767 at gmail.com .

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Guide ,
Sep 10, 2018 Sep 10, 2018

You can try to setup a 'watch' folder. I did this many years ago for a similar need, but not nearly as many documents.

I'm not positive this will still work, but worth a search:

https://www.google.com/search?q=acrobat+watch+folder

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Sep 10, 2018 Sep 10, 2018

Acrobat doesn't support watched folders of any kind. Distiller does, though.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Sep 10, 2018 Sep 10, 2018

The Distiller documentation only mentions converting Postscript to PDF. Since the files I have are PDF, I don't see how Distiller would help...

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Guide ,
Sep 12, 2018 Sep 12, 2018

Ah, your right, sorry. I must have been processing PS files. It was a long time ago...and didn't address your question about converting to a different format.

how about these thread?

How Do I Convert Multiple PDF to Word Documents?

batch convert pdf to txt using OCR with Adobe Acrobat Pro

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Sep 26, 2018 Sep 26, 2018
LATEST

Looks like

codeproject.com/Articles/7056/Code-to-extract-plain-text-from-a-PDF-file

will get me on my way. I suspect some assembly will be required.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines