Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Retrieve text and image alt-text for read-aloud feature

Community Beginner ,
Jun 29, 2023 Jun 29, 2023

Hi guys

I am currently working on a java project that requires the implementation of a read-aloud feature for PDF documents. The PDFs I'm dealing with include images with alt-text. My goal is to extract both the text and alt-text from the PDF while maintaining the correct reading order to enable the read-aloud functionality.

 
To accomplish this, I would appreciate your guidance on the following:
1. Extracting the text from the PDF while preserving the reading order that I've set using Adobe Acrobat.
2. Extracting the alt-text associated with the images in the PDF, also following the correct reading order.
3. Combining the extracted text and alt-text in the right order, which has been set using accessibily, to generate the content for a text-to-speech system.
 
I am using pdfservice-sdk
Regards
454
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
Jun 30, 2023 Jun 30, 2023

Have you looked at/tried the Extract API? I'm not sure about alt-text for images, but in general, Extract gets _everything_ out.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jul 04, 2023 Jul 04, 2023

Hi Raymond

Thanks for your response. I did use the API, i tried to set the ExtractPDFOptions but I dont see ExtractElementType.ALT (or anything equivalent). The result I got was not all the text and alt-text in the reading order.

Please give me some hints

Thanks

 

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
Jul 10, 2023 Jul 10, 2023
LATEST

Can you share a PDF w/ images that make use of alt text? If it's private, DM me. 

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Resources