Copy link to clipboard
Copied
I am exporting a tagged PDF to HTML using Adobe Acrobat, and the process works as expected. The resulting HTML stores the images from the document in a separate folder. However, the issue lies with the alt attribute in the <img> tags. All the <img> tags have the same alt text: alt="image".
<img width="718" height="403" alt="image" src="mytest_files/Image_001.png" />
<img width="718" height="403" alt="image" src="mytest_files/Image_002.png" />
Why doesn't the converted HTML retain the original alternate text from the PDF? Is there a way to preserve the alternate text in the exported document?
Copy link to clipboard
Copied
Copy link to clipboard
Copied
All the <img> tags have the same alt text: alt="image".
By @dpbhatt<img width="718" height="403" alt="image" src="mytest_files/Image_001.png" /> <img width="718" height="403" alt="image" src="mytest_files/Image_002.png" />
In addition, alternative text should accurately describe the content or purpose of the image for individuals who cannot see it. Adding alt="image" is useless to those who need it.
Jane
Copy link to clipboard
Copied
Hello,
I hope you're doing well, and we apologize for the delayed response and the trouble.
Preserving alternate text (alt text) from a PDF when converting it to an HTML document can be challenging, as not all conversion tools retain this information. Alt text is crucial for accessibility, providing descriptions of images and other non-text elements for screen readers.
Steps to Preserve Alt Text During PDF to HTML Conversion:
Ensure Proper Tagging in the Original PDF:
Use Adobe Acrobat's Export Feature:
File
> Export To
> HTML Web Page
.Verify Alt Text in the HTML Output:
alt
attributes are present for image tags.Additional Considerations:
Limitations of Conversion Tools:
Manual Adjustments:
alt
attributes for each image tag.For more information, see this article: https://adobe.ly/4hllAuw
I hope this helps.
Thanks,
Anand Sri.