Skip to main content
Participating Frequently
May 22, 2019
Answered

Large Scanning Project Question

  • May 22, 2019
  • 1 reply
  • 434 views

Hi, I have a client who has about 375,000 pages to scan. They just want to be able to search them by about 6 words. The words are pretty unique, like people's names & DOB etc., so they're not really "keywords". I'm wondering if they just put these words in one meta description field do you think that will work? Would all this go in a cache, that I believe has 100MB limit?

So to search they would say "metadata contains xx" and "metadata contains yy"... Do you think the search time would be acceptable?

Thanks!

    This topic has been closed for replies.
    Correct answer Lumigraphics

    Good luck with that. They will spend more on equipment than it will cost to contract the job out.

    And if they don't use OCR, how will they pull details from the documents?

    1 reply

    Legend
    May 23, 2019

    Is this going to be scanning to pdf and then running OCR on the scans?

    This is a LARGE project, they might want to look into farming it out to a document processing company. They would need a bulk scanner and lots of storage for that many pages.

    Participating Frequently
    May 23, 2019

    Yes it will scan to PDF but not OCR.

    They don't want to farm it out, they want to hire cheap labor to scan it in...

    LumigraphicsCorrect answer
    Legend
    May 23, 2019

    Good luck with that. They will spend more on equipment than it will cost to contract the job out.

    And if they don't use OCR, how will they pull details from the documents?