(Combine and OCR) or (OCR and Combine), which is better?
I scanned old internal reports to hundreds of PNGs. I need to turn each batch of PNGs into a single OCRed PDF using Acrobat. I am wondering which of the following two options is better:
- "Combine and OCR" -- combine PNGs into a PDF and then perform OCR (Editable Text and Images)
- "OCR and Combine" -- Perform OCR on multiple files (Editable Text and Images) and then combine the resulting PDFs into one PDF.
Preliminary testing (with 10 pages) seems to show that option 1 (Combine and OCR) will yield slightly smaller PDFs, but I am not sure why or if I can expect this on large scale as well.

I assume that if I combine the PNGs to a single PDF and I OCR this PDF, the OCR engine can perhaps better optimize the fonts, the document overhead, etc. than when combining already OCRed PDFs into one larger PDF.
Anyone with more experience can provide advice or thoughts?

