Issues in tagging when exporting from Word to PDF for accessibility/tags
Hi all,
I often have to remediate files for Section 508 compliance that come from Word files that are translated from English files into Spanish.
Lately I have come across several issues with some of these files. I have no way of influencing file design, so I have to basically convert what I am given to PDF from Word. I have tried using two approaches: Save As, then selecting PDF and directly Save as PDF File, which uses Adobe PDF Maker. Generally, the latter option produces better results, but there are many issues that should not really happen.
In today's file, these are some of the issues that came up:
1. Parts of text are not tagged at all. These can be entire paragraphs or lines. Text in regular body or in text boxes. I noticed that this file has a lot of footnotes, and a lot of the text that is missing happens to ocur right after the number for the footnote, especially when the paragraph starts in a page, follows onto the other and the footnote number appears on the second page. This happened several times in the file. The footnote links themselves are missing in several instances as well. This is the most serious issue. I am dealing with a 377-page file this time. I found several instances of missing text, and I will try to remediate those errors by tagging manually. But this doesn't always work. Retagging is buggy as well. But the issue is I can't possibly review all of the tags in the document to make sure nothing is missing. It would take ages.
2. These footnote numbers are linked to the actual footnote, using a LINK tag, but the OBJR tag is placed outside of the link tag sometimes.
3. Tagging of missing text per item 1 above is extremely buggy. If I simply select the text and tag it as "P" using the Reading Order tool, some of the text goes missing (especially in areas with highlighted text). This document has a lot of highlighted text, which is not ideal, but I can't control that. When simply tagging this way text that has highlights, it seems to be "sent to the back".
4. For some reason, in some paragraphs, under the P tag, there are several identical containers which seem to have the affected text for that paragraph, instead of a single container.
My goal is to perhaps get some guidance on how to fix the file in Word so that this doesn't happen or to get the attention of someone from the development team so they can fix the way Adobe PDF Maker creates the files so this can be fixed.
Thanks!
