Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

I have a pc, windows 7, adobe acrobat X V. 10: some of scanned pages are not being OCR? why? Is there something wrong with Acrobat OCR?

Community Beginner ,
Jul 15, 2016 Jul 15, 2016

I have a pc, windows 7, adobe acrobat X V. 10: some of scanned pages are not being OCR? why? Is there something wrong with Acrobat OCR?

TOPICS
Acrobat SDK and JavaScript , Windows
1.2K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines

correct answers 1 Correct answer

Adobe Employee , Jul 21, 2016 Jul 21, 2016

I checked the file you shared, it is partially OCRed. That's why you are getting the issue. We are trying to resolve it in latest Acrobat DC.

For now you can use the workaround available for this issue

1. Go to File> Save As Other> Image> (any Format)

2. Create new PDF from these images.. Create > PDF from File> select all these images

3. Run OCR now. It will work and OCR it completely

It may be an overhead but will resolve your issue as of now. Please feel free to ask anything you want.

Thanks.

Translate
Adobe Employee ,
Jul 19, 2016 Jul 19, 2016

Hi Alton,

Acrobat X goes end of life last year. So there is no change in Acrobat in past few months.

Please try trial version of Acrobat DC to run OCR on the same file. Acrobat DC has much better output for OCR than Acrobat X.

Download Adobe Acrobat free trial | Acrobat Pro DC

Also please share

- the error you are getting and

- a sample PDF file using https://cloud.acrobat.com/send where you are facing this issue.

Thanks,

Lovekesh Garg

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jul 19, 2016 Jul 19, 2016

Hi Lovekesh,

Here is an example where pg 53 is OCR and the other isn’t.

https://files.acrobat.com/a/preview/bded1332-ecc9-46d2-a04e-c65c05a84cfe

Screenshot shown below. Your saying Acrobat DC will fix this issue.

Thanks,

Alton Chung | Senior Regulatory Affairs Associate|

Amneal Pharmaceuticals |

41 Colonial Drive, Piscataway NJ 08854 |

732.645.3030 | ext. 6211 |

altonc@amneal.com

P

IMPORTANT NOTICE: The information contained in this electronic e-mail and any accompanying attachment(s) is intended only for the use of the intended recipient and may be confidential and/or privileged. If any reader of this communication is not the intended recipient, unauthorized use, disclosure or copying is strictly prohibited, and may be unlawful. If you have received this communication in error, please immediately notify the sender by return e-mail, and delete the original message and all copies from your system. Thank you.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jul 19, 2016 Jul 19, 2016

Hi Lovekesh,

Here’s a better example, disregard the previous example. The first page has chromatogram section in middle not highlighted, or OCR. The 2nd page has been OCR.

https://files.acrobat.com/a/preview/59b73c24-224c-4668-83a9-c73e2d67082d

Thanks,

Alton Chung | Senior Regulatory Affairs Associate|

Amneal Pharmaceuticals |

41 Colonial Drive, Piscataway NJ 08854 |

732.645.3030 | ext. 6211 |

altonc@amneal.com

P

IMPORTANT NOTICE: The information contained in this electronic e-mail and any accompanying attachment(s) is intended only for the use of the intended recipient and may be confidential and/or privileged. If any reader of this communication is not the intended recipient, unauthorized use, disclosure or copying is strictly prohibited, and may be unlawful. If you have received this communication in error, please immediately notify the sender by return e-mail, and delete the original message and all copies from your system. Thank you.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
Jul 20, 2016 Jul 20, 2016

Sorry for the issue you are facing.

Both the pages seems quite similar. It's not expected that 1 page is OCRed and other not. I tried to copy the image(chromatogram section which was not OCred) and run OCR on it in Acrobat DC and it worked fine.

Can you please provide following information:

- Both the pages OCRed with same Acrobat and same OCR settings

- Acrobat version you used to OCR

- Did you try Acrobat DC for this file

- Please provide original file before OCR as well to help us identify what's causing this issue.

Thanks.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jul 20, 2016 Jul 20, 2016

Hi Lovekesh,

The Acrobat version used is X. (Patch update 10.0.0)

I didn’t try Acrobat DC, I will try.

Attached is ocr-1 (this is ocr- you’ll notice page 3-10 chromatogram did not ocr)

Attached is not-ocr-1(this is original and not ocr.)

Thanks,

Alton Chung | Senior Regulatory Affairs Associate|

Amneal Pharmaceuticals |

41 Colonial Drive, Piscataway NJ 08854 |

732.645.3030 | ext. 6211 |

altonc@amneal.com

P

IMPORTANT NOTICE: The information contained in this electronic e-mail and any accompanying attachment(s) is intended only for the use of the intended recipient and may be confidential and/or privileged. If any reader of this communication is not the intended recipient, unauthorized use, disclosure or copying is strictly prohibited, and may be unlawful. If you have received this communication in error, please immediately notify the sender by return e-mail, and delete the original message and all copies from your system. Thank you.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jul 20, 2016 Jul 20, 2016

Hi Lovekesh,

I am sending again with the two files in the cloud. Link below.

The Acrobat version used is X. (Patch update 10.0.0)

I didn’t try Acrobat DC, I will try.

https://files.acrobat.com/a/preview/71edb04b-c380-4982-9914-4e3746447bad

Thanks,

Alton Chung | Senior Regulatory Affairs Associate|

Amneal Pharmaceuticals |

41 Colonial Drive, Piscataway NJ 08854 |

732.645.3030 | ext. 6211 |

altonc@amneal.com

P

IMPORTANT NOTICE: The information contained in this electronic e-mail and any accompanying attachment(s) is intended only for the use of the intended recipient and may be confidential and/or privileged. If any reader of this communication is not the intended recipient, unauthorized use, disclosure or copying is strictly prohibited, and may be unlawful. If you have received this communication in error, please immediately notify the sender by return e-mail, and delete the original message and all copies from your system. Thank you.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
Jul 21, 2016 Jul 21, 2016

I checked the file you shared, it is partially OCRed. That's why you are getting the issue. We are trying to resolve it in latest Acrobat DC.

For now you can use the workaround available for this issue

1. Go to File> Save As Other> Image> (any Format)

2. Create new PDF from these images.. Create > PDF from File> select all these images

3. Run OCR now. It will work and OCR it completely

It may be an overhead but will resolve your issue as of now. Please feel free to ask anything you want.

Thanks.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jul 21, 2016 Jul 21, 2016

Hi Lovekesh,

Thanks for the solution. This actually worked and the resolution of the pages were not compromised much. I was also tried saving as postscript file and distilling to original document. All the pages were OCRed after ocr’ing.

Thanks again. I will reply to this email for future inquiries.

Thanks,

Alton Chung | Senior Regulatory Affairs Associate|

Amneal Pharmaceuticals |

41 Colonial Drive, Piscataway NJ 08854 |

732.645.3030 | ext. 6211 |

altonc@amneal.com

P

IMPORTANT NOTICE: The information contained in this electronic e-mail and any accompanying attachment(s) is intended only for the use of the intended recipient and may be confidential and/or privileged. If any reader of this communication is not the intended recipient, unauthorized use, disclosure or copying is strictly prohibited, and may be unlawful. If you have received this communication in error, please immediately notify the sender by return e-mail, and delete the original message and all copies from your system. Thank you.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jul 26, 2016 Jul 26, 2016
LATEST

This is a test.

Thanks,

Alton

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines