Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Reading/Optimizing scanned pages in a Batch Process to convert into readable format using IAC or plugins in VBA/ Python or VB.Net

Guest
May 30, 2017 May 30, 2017

I'm working on an automation project which requires me to read the contents of the pdf. Some pages have partial content in the form of readable content(PDF) whereas partial content is in the form of scanned images. I need to write a code in python/VBA/VB.Net using any of the inbuilt functionality of Pro DC such as API, functions or commands. One such I came across named as AVDocSaveOptimized but I am not sure whether it optimizes scanned pages or not? If anyone can help in anyway or if someone has a direct code to do so?

TOPICS
Acrobat SDK and JavaScript
996
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
May 30, 2017 May 30, 2017

Is this for a server? acrobat is a personal tool with some very light duty interactive automation.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Guest
May 30, 2017 May 30, 2017

No, not for a server. Any help?

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
May 30, 2017 May 30, 2017

I can tell you the method you mentioned does not do OCR. You need to use the AVCommand API, which is very poorly documented.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Guest
May 31, 2017 May 31, 2017

I am already using the AV Command API.

LeT me give you my precise piece of code so that you may be able to help me in a better way.

Last 2 lines of code in python:

avDoc1 = Dispatch("AcroExch.App")
scanok=avDoc1.MenuItemExecute("Cpt:CapturePages")

Now, when I run this code, it opens up the Pro DC and lands on the specific page where I have the button "recognize text". All I need to do is just write some more line of command which will help me to shift the command to this button and press it so that this operation(recognizing text) gets started and I have the desired output pdf file directly opened.

OR

If you can tell me precisely which command to use in AV Command API to operate recognize text button, it will be a really great help which will solve my problem.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
May 31, 2017 May 31, 2017
LATEST

That is not the AVCommand API. It is not even a plug in! You will not use one API but a number of them, after research into names and settings. Expect at least a month of study and experiment if you are an experienced C programmer.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines