Character encoding issues when a document is autotagged
I've been having this issue recently that when I autotag a document, it leads to character encoding issues. Except, it doesn't always show up as a failure in that accessibility checker. Sometimes letters just *disappear*. As in, I can see them on the page, but they're no longer in the content containers when I check the tag tree and aren't voiced with a screen reader. Some examples are, "refective, fltered, beneft, specifc, defned". I'm unsure why some seemingly random characters are just missing. This has happened with several PDFs, and don't have access to the source documents, either. When I check character encoding before autotagging, no issues come up. But in most cases, I have to autotag, because the documents don't have space at the end of lines of text in a paragraph, meaning the first and last words will be smooshed together, and autotagging is the only way I've found to fix that. Does anyone know why this is happening or if I can fix it? Furthermore, is there any way to catch it early on, rather than while listening to the document after fully remediating it? Thank you.
