Skip to main content
Participating Frequently
September 23, 2019
Question

Create PDF, why KANJI 9AD8(高) will be changed to 2FBC(⾼) when Meiryo UI ?

  • September 23, 2019
  • 3 replies
  • 15936 views

When I create PDF by Adobe Acrobat Distiller.

Acrobat changes KANJI 9AD8(高) to 2FBC(⾼) when Meiryo UI.

Then internet world, I can see many documents includes 2FBC(⾼).

Normally it is difficult to input character 2FBC(⾼) to documents.

This behavior is not convenient. We can not serch document include "高".

Could you teach Adobe company about this phenomenon.

Step1 Original Word document.

Step2 Acrobat PDF. I can not search Meiryo UI 6587.

Step3 Word PDF. I can search Meiryo UI 6587. No problem.

 

 

 

This topic has been closed for replies.

3 replies

assause
Community Expert
Community Expert
September 23, 2019

元のOSバージョン+作成アプリケーションと、Distillerのバージョン、そしてどのように変換を行ったのか、といった情報が必要にはなります。

 

ただ、Windows 10+Word 2016上で作成した「高い」という文字を含んだ文書を、Adobe PDFプリンタードライバー経由で標準設定で書き出したPDFからテキスト抽出したものをコード確認する限りは、u+9ad8となっていることを確認しました。

 

Participating Frequently
October 1, 2019
assause-さん ありがとうございます。ポイントはMeiryo UIフォントを使うことです。MS Gothicなどでは起きません。この問題が発生する原因はDistillerが利用しているライブラリーが関係します。EUC、SJIS、UTF16などの文字コード変換すると、Meiryo UIフォントではCJKの漢字と康煕字典部首コードの漢字が同じにリンクされているため、予期せぬ結果になります。康煕字典部首コードにリンクしていないMS Gothicなどでは、この問題は起きません。
assause
Community Expert
Community Expert
October 2, 2019

改めて行ってはみたのですが、u+2ad8がPDFにした際にu+9fdcに統合される、という現象にはなりました。

いくつかのフォントを用いましたが、いずれも同様です。

実際にテストしたデータを添付しておきます。

 

 

Legend
September 23, 2019

Distiller does not understand CJK remapping, it just takes its input and makes a PDF. So we need to look closely at all the steps and settings that you use on the way to the PDF. I checked the Meiryo UI font included with Windows 8.1, and it does include U+9AD8.

An interesting point is that Chrome shows both of your code points as identical 

while some pages show different eg

(Key point for me: Is the low centre box detached?)

Participating Frequently
September 25, 2019

Thank you for Test_Screen_Name-san. Distiller does not have remapping to CJK, of course.

But, some application had the function that use first code than large code in KANJI code. Because KANJI code has simple(current) style code and difficult(old) style code. For example 

4E80(亀and 9F9C(). Two KANJI character has the same mean KAME=Turtle. This function select 4E80 than 9F9C, because user should chose current style code. But Meiryo has more more first code 2FD4(⿔), so this phenomenon occurs, If disttller application codes includes this function.

Participating Frequently
September 25, 2019
Since you have Acrobat, I assume Acrobat DC, please convert with the Acrobat ribbon in Microsoft Word. This does not use Distiller and should get much better results.
Thank you Test_Screen_Name-san. Of course, I don't use Distiller, then I get best results. Only Distiller has been sprinkling dirty characters.
ls_rbls
Community Expert
Community Expert
September 23, 2019

What is the original document created with before you convert to PDF?

Participating Frequently
September 25, 2019
Thank you for ls_rbls-san. I used Office 365 word. This problem is appear at Meiryo UI font, not appear at MS UI Gothic.