終了

"Correct recognized text" displays gibberish when working with Japanese

Community Beginner ,
Nov 07, 2023 Nov 07, 2023

リンクをクリップボードにコピー

コピー完了

I am using Acrobat Pro for MacOS, Version 2023.006.20360, on a 2021 14-inch MacBook Pro running  an Apple M1 Pro chip. The OS is Sonoma, 14.0.

 

I want to correct the OCR recognized text of a scanned Japanese document, but even though Acrobat has successfully recognized 99% of the Japanese text, when I run "Correct recognized text," the text in the "Recognized as" space in the pop-up window displays as classic "mojibake" gibberish. Here's a concrete example. The actual Japanese text should be 中公新書. Acrobat successfully recoginizes the first three characters, but misinterprets the last character as ",'}." This is easily confirmed when I copy the recognized text into a word processor. Yet the pop-up window in "Correct recognized text" displays the recognized text as "�����s�A.�b" All the recognized Japanese text is displayed in this way, so Acrobat's "Correct recognized text" function is useless in Japanese. There's no way to tell from the displayed results whether the text was successfully recognized, and I can only check it by copying all the text and pasting it into a word processor, which makes correcting the text extremely difficult (obviously). Although the attached image is the app displaying in English, when I switch the app language to Japanese the same thing happens.

Screenshot 2023-11-07 at 17.15.39.png

表示

70

翻訳

翻訳

レポート

レポート
コミュニティガイドライン
他のユーザーへの思いやりを持ち、敬意を払いましょう。コンテンツの出典を明記し、投稿する前に内容が重複していないか検索してください。 さらに詳しく
community guidelines
no replies

何か追加しますか?

会話に参加する