• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers
Exit
0

Losing characters when I try to convert from PDF to other file extensions

New Here ,
Oct 17, 2024 Oct 17, 2024

Copy link to clipboard

Copied

I am facing the problem. I have my PDF file, and I need it to convert to HTML. I am from Serbia, and in my PDF there are special characters like ć.č,đ,š,ž. When i convert to HTML or Word file, Instead, I get characters like {,^,~,},`, Could someone know the fix for this? Thanks

TOPICS
Create PDFs , Edit and convert PDFs , How to , PDF , PDF forms

Views

58

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
Jan 16, 2025 Jan 16, 2025

Copy link to clipboard

Copied

LATEST

Hi there

 

I hope you are doing well, and I'm sorry to keep you waiting.

 

It seems like the issue you're experiencing is related to encoding. When converting a PDF with special characters (such as ć, č, đ, š, ž), the software might not properly handle the character encoding, resulting in incorrect symbols.

 

Here are some steps you can try :

- Open the PDF in Acrobat.
- Go to File > Export To > HTML Web Page or Microsoft Word. In the export settings, make sure the language is set to Serbian or a compatible language that includes special characters.
- Export the file and check the output.

You may also check Font Embedding in the PDF:

- Open the PDF in Acrobat.
- Go to File > Properties > Fonts and check if the fonts used in the document are embedded. If they’re not, the characters might not map correctly during conversion. Try embedding the fonts and exporting again.

Also, check your system locale and language:

- Ensure your computer’s system locale is set to support Serbian characters:
Windows: Go to Control Panel > Region > Administrative > Change system locale, and select Serbian.
Mac: Check System Preferences > Language & Region.


If the PDF contains scanned images or text that isn't selectable, use an OCR tool that supports Serbian characters. Adobe Acrobat's OCR feature supports multiple languages and can help recognize special characters during conversion.

Let us know how it goes.

 

 

Regards
Amal

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines