Copy link to clipboard
Copied
I am trying to use text recognition on oversized documents to recognize text so that I can hi-lite it. The documents are about 7.5 inches by 19 inches. I am using Acrobat 10 to do so and it appears not capable of recognizing text above 8.5 by 11 page sizes. Is there any program or any way of having text recognized above 8.5 by 11 page sizes?
I have opened the large acrobat document in Photoshop cs5 and changed the size of the page to 8.5 by 11 and then saved it as a photoshop pdf keeping the original resolution at 300. I need 300 because otherwise I can’t read the page onscreen when I enlarge it. I also have changed the settings to no layers and jpeg minimum. The original document file size is 358kb and the Photoshoped converted ends up at 2.21 mbytes. That is the best I could do because if I ran it at maximum jpg it would come out to 6mbytes. Is there any through Photoshop to keep file size at the original 358kb. The extra file size wouldn’t normally bother me for one file but I have thousands I have to convert.
So either I need a way to do text recognition on larger sized pages or I need to be able to convert documents with Photoshop and keep near the same file sizes.
Copy link to clipboard
Copied
Hi John,
Sorry for the issue you are facing. This should not happen. Can you please share the following information to help us identify and resolve the issue ASAP:
- OS Detail
- Acrobat Version
- Are you getting any error message
- 1 Sample file of this large page size (you can use https://cloud.acrobat.com/send to share the file)
Thanks,
Lovekesh Garg
Copy link to clipboard
Copied
These are newspaper-size pages that have already been scanned but not with text recognition. They come in full page sizes and in clip sizes. Most of the time I can use Acrobat's text recognition on the clips depending on the quality but on larger full page sizes forget it because I have tried more than a dozen times on different dates. It DOESN't WORK though the clips of the same page do work and for some reason the clips are usually of slightly less quality than the full pages.
The smaller clips are all fine if the clips are small but sometimes the clips span many columns or even a full page and so text recognition won't work. I am using Acrobat 11 and Photoshop CS5.
I'll upload a sample sometime later.
Copy link to clipboard
Copied
Hi John,
Are you still facing this issue. If yes, please share a sample document(You can even share it personally if you have any security reasons). You can also use https://cloud.acrobat.com/send
Also please try latest Acrobat DC where size limit was handled in a quite better way.
Download Adobe Acrobat free trial | Acrobat Pro DC
Thanks.
Copy link to clipboard
Copied
Some thoughts
1. You are using obsolete and unsupported versions of both Acrobat and Photoshop. Things could have improved.
2. I think you are going to run OCR on this Photoshop-saved PDF? If so the file size is irrelevant. What matters is quality for OCR. It is generally accepted that JPEG ruins OCR. So save as ZIP and don't count megabytes.
3. Focus closely on the OCR settings, they will dictate the file size.
Find more inspiration, events, and resources on the new Adobe Community
Explore Now