Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Text Recognition on oversized documents or Photoshop PDF conversion with near original file sizes needed

Guest
Aug 15, 2016 Aug 15, 2016

   I am trying to use text recognition on oversized documents to recognize text so that I can hi-lite it. The documents are about 7.5 inches by 19 inches. I am using Acrobat 10 to do so and it appears not capable of recognizing text above 8.5 by 11 page sizes. Is there any program or any way of having text recognized above 8.5 by 11 page sizes?

   I have opened the large acrobat document in Photoshop cs5 and changed the size of the page to 8.5 by 11 and then saved it as a photoshop pdf keeping the original resolution at 300. I need 300 because otherwise I can’t read the page onscreen when I enlarge it. I also have changed the settings to no layers and jpeg minimum. The original document file size is 358kb and the Photoshoped converted ends up at 2.21 mbytes. That is the best I could do because if I ran it at maximum jpg it would come out to 6mbytes. Is there any through Photoshop to keep file size at the original 358kb. The extra file size wouldn’t normally bother me for one file but I have thousands I have to convert.

      So either I need a way to do text recognition on larger sized pages or I need to be able to convert documents with Photoshop and keep near the same file sizes.

TOPICS
Acrobat SDK and JavaScript
1.0K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
Aug 16, 2016 Aug 16, 2016

Hi John,

Sorry for the issue you are facing. This should not happen. Can you please share the following information to help us identify and resolve the issue ASAP:

- OS Detail

- Acrobat Version

- Are you getting any error message

- 1 Sample file of this large page size (you can use https://cloud.acrobat.com/send  to share the file)

Thanks,

Lovekesh Garg

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Guest
Aug 16, 2016 Aug 16, 2016

    These are newspaper-size pages that have already been scanned but not with text recognition. They come in full page sizes and in clip sizes. Most of the time I can use Acrobat's text recognition on the clips depending on the quality but on larger full page sizes forget it because I have tried more than a dozen times on different dates. It DOESN't WORK though the clips of the same page do work and for some reason the clips are usually of slightly less quality than the full pages.

    The smaller clips are all fine if the clips are small but sometimes the clips span many columns or even a full page and so text recognition won't work. I am using Acrobat 11 and Photoshop CS5.

    I'll upload a sample sometime later.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
Oct 21, 2016 Oct 21, 2016
LATEST

Hi John,

Are you still facing this issue. If yes, please share a sample document(You can even share it personally if you have any security reasons). You can also use https://cloud.acrobat.com/send

Also please try latest Acrobat DC where size limit was handled in a quite better way.

Download Adobe Acrobat free trial | Acrobat Pro DC

Thanks.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Aug 16, 2016 Aug 16, 2016

Some thoughts

1. You are using obsolete and unsupported versions of both Acrobat and Photoshop. Things could have improved.

2. I think you are going to run OCR on this Photoshop-saved PDF? If so the file size is irrelevant. What matters is quality for OCR. It is generally accepted that JPEG ruins OCR. So save as ZIP and don't count megabytes.

3. Focus closely on the OCR settings, they will dictate the file size.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines