Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
0

Acrobat Autotagging for a document in Spanish removing accented characters from tags/content panel

New Here ,
Oct 23, 2024 Oct 23, 2024

Hi, my job wants me to tag a pdf file for accessibility for a document in Spanish. I've converted the document from word to pdf using the Acrobat extension on word.

 

My issue occurs when autotagging the the document in Acrobat. When I use the autotag, it removes any accented characters of the letters o and u (ó and ú) from the tags and the content panel. It only does this for the letters ó and ú. The letters á, é, and í are not impacted by autotagging. For instance, the Spanish word "Protección" shows up as "Proteccin" in the tags/content panels. This does NOT impact the actual PDF text itself, as it correctly shows the accented characters.

 

It also specifically only does this for autotagging, as the tags that are initally created when first converting from word to pdf do have the accents in the tags/content panels. However, I'd highly prefer to use the autotag function as it would save me hours of work for this document.

 

The only workaround I've found is to edit the pdf text by removing and readding the impacted characters, but this would also be a slow process.

 

I have made sure to:

 

- embed fonts when converting word to pdf

- change the language in acrobat to spanish

 

My Acrobat Pro version is currently 24.003.20180.

 

Any help would be appreciated.

TOPICS
General troubleshooting , Standards and accessibility
1.0K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Explorer ,
Oct 23, 2024 Oct 23, 2024

What font are you using?  Maybe the Unicode mapping is not complete and does not include those characters. Try using a standard Windows font like Arial and see if the results change.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 24, 2024 Oct 24, 2024

I was using Calibri. I made a version in Arial on your suggestion but the problem was also present on the Arial version.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Explorer ,
Oct 24, 2024 Oct 24, 2024

Sorry to hear that. Can you upload the file? Acrobat stores language information in the File properties and also in the Tags and content. This can create some problems depensing on how the file was created but this is a new one on me. While I doubt i could fix this but I'd like to see whats going on. Understand if you can't share the file. If you hapoen tp have Axes PDF Quick Fix removing all language settings from tags and content might change things. IDK

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Oct 25, 2024 Oct 25, 2024
LATEST

I can't upload the file, but I can upload this sample I just made that has the same issue. The attached pdf was autotagged, resulting in the missing ó and ú letters in the tags content panel.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines