Skip to main content
Participating Frequently
January 18, 2024
해결됨

Character encoding error in just some of several similar PDF files

  • January 18, 2024
  • 1 답변
  • 4318 조회

Hello community!

I hope that you can help me with an error I can't understand. I have two different PDF files, but created in the same way. One of them gets a character encoding error on a Heading Level 2 where there is a number in the text, but not the other one.

The one with the error is called landskronaweb.pdf and the one that passes Acrobat accessibility check is lommaweb.pdf.
I would appriciate any input of what the difference is between the files because I can't find it.
Thanks!

 

이 주제는 답변이 닫혔습니다.
최고의 답변: try67

Thank you! I appriciate it so much!! I hade no idea that you could see that in a PDF, can you see the unvalid characters in Acrobat?
I have created the files with CeTe DynamicPDF, is that what you mean by library?

 


Not directly. I used a script I wrote to print out all the text, including any hidden characters, and then it showed up. Something similar you can do, though, is to copy that text and then paste it into a plain-text editor and you'll see the "1" appears as a square symbol.

 

Yes, "CeTe DynamicPDF" is the library/application that needs to be checked for a solution to this issue. Another option can be the font that was used ("Bitter Pro"). Maybe it has the wrong encoding for this character... You can try using a different font and see if the issue still happens.

1 답변

try67
Community Expert
Community Expert
January 18, 2024

Where do you see this error message, exactly?

Hanna Lou작성자
Participating Frequently
January 18, 2024

When I run the Accessibility check in Acrobat, I get two errors on that file and the marker highlights the number in the Heading. I took a screenshot of the error and attach it here so you can see, it's in the Swedish version of Acrobat but "Teckenkodning" is Character Encoding.

 

try67
Community Expert
Community Expert
January 18, 2024

There are indeed some weird characters around that "1" in the title, namely:

LANDSKRONA NO \uDBC1\uDC2E Axelro, 

They don't appear to be valid Unicode characters, hence the error you're getting. This is an issue with the library that created the PDF, most likely.