Identify Non-Ocr files in a Large Library and OCR them
Hello,
I have around 5000 pdf files in various folders/subfolders; most of them are OCRed already, but some are not.
The thing is when I use the OCR tool on my root folder, it will also OCR the files that are already OCRed, which consume a lot of time and resources unnecessarily.
So my question is: How could I OCR only the files which are not OCRed already, without having to check manually?
Many thanks in advance!
