Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
8

Character encoding error in just some of several similar PDF files

Community Beginner ,
Jan 18, 2024 Jan 18, 2024

Hello community!

I hope that you can help me with an error I can't understand. I have two different PDF files, but created in the same way. One of them gets a character encoding error on a Heading Level 2 where there is a number in the text, but not the other one.

The one with the error is called landskronaweb.pdf and the one that passes Acrobat accessibility check is lommaweb.pdf.
I would appriciate any input of what the difference is between the files because I can't find it.
Thanks!

 

TOPICS
Create PDFs , Standards and accessibility
2.7K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
2 ACCEPTED SOLUTIONS
Community Expert ,
Jan 18, 2024 Jan 18, 2024

There are indeed some weird characters around that "1" in the title, namely:

LANDSKRONA NO \uDBC1\uDC2E Axelro, 

They don't appear to be valid Unicode characters, hence the error you're getting. This is an issue with the library that created the PDF, most likely.

View solution in original post

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 19, 2024 Jan 19, 2024

Not directly. I used a script I wrote to print out all the text, including any hidden characters, and then it showed up. Something similar you can do, though, is to copy that text and then paste it into a plain-text editor and you'll see the "1" appears as a square symbol.

 

Yes, "CeTe DynamicPDF" is the library/application that needs to be checked for a solution to this issue. Another option can be the font that was used ("Bitter Pro"). Maybe it has the wrong encoding for this character... You can try using a different font and see if the issue still happens.

View solution in original post

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 18, 2024 Jan 18, 2024

Where do you see this error message, exactly?

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jan 18, 2024 Jan 18, 2024

When I run the Accessibility check in Acrobat, I get two errors on that file and the marker highlights the number in the Heading. I took a screenshot of the error and attach it here so you can see, it's in the Swedish version of Acrobat but "Teckenkodning" is Character Encoding.

 

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 18, 2024 Jan 18, 2024

There are indeed some weird characters around that "1" in the title, namely:

LANDSKRONA NO \uDBC1\uDC2E Axelro, 

They don't appear to be valid Unicode characters, hence the error you're getting. This is an issue with the library that created the PDF, most likely.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jan 19, 2024 Jan 19, 2024

Thank you! I appriciate it so much!! I hade no idea that you could see that in a PDF, can you see the unvalid characters in Acrobat?
I have created the files with CeTe DynamicPDF, is that what you mean by library?

 

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Expert ,
Jan 19, 2024 Jan 19, 2024

Not directly. I used a script I wrote to print out all the text, including any hidden characters, and then it showed up. Something similar you can do, though, is to copy that text and then paste it into a plain-text editor and you'll see the "1" appears as a square symbol.

 

Yes, "CeTe DynamicPDF" is the library/application that needs to be checked for a solution to this issue. Another option can be the font that was used ("Bitter Pro"). Maybe it has the wrong encoding for this character... You can try using a different font and see if the issue still happens.

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jan 19, 2024 Jan 19, 2024

Ok thanks! I checked the string now in my .NET code to see if the string was unvalid unicode before putting it on the PDF but they're both valid without strange characters, so maybe I have to check with CeTes support and see if they can help me. I will try a different font as well. New error for me so maybe it's a font issue. Thank you so much for your valuable help!!

 

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Community Beginner ,
Jan 26, 2024 Jan 26, 2024
LATEST

For info, CeTe has tried BitterPro and there is a problem with Bitter Pro encoding numbers. I've changed the  font now to a better working.

So greatful for the help I got here!

 

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines