Copy link to clipboard
Copied
I want to import FDF file which cantains Chinese characters into PDF file. But there is problem of Chinese disorderly code after importing. I know XFDF file supports Chinese characters. Can anyone confirm that FDF file doesn't support Chinese characters? Thanks.
Tip: notice the first two characters in the string after (. These are similar to "by" but they are not these actual characters, they are "þÿ") These are the BOM (byte order mark meaning the characters are UCS-2). This makes sense because the syntax of FDF objects is just the same as PDF.
Copy link to clipboard
Copied
Try this:
fill in the form with Chinese characters
export an FDF
close the PDF without saving
open the PDF
import the FDF
Does it work? I think it will work. However, you will have to match the encoding of the font in this particular PDF, which won't be anything as simple as Unicode. This may be so challenging that XFDF is a much better option.
Copy link to clipboard
Copied
Thanks for your response. Yes, it works. In the exported FDF file, there is still issue of Chinese disorderly code. Please see below.
What I needed is importing FDF created by Python. Do you have any idea what encoding type should I need to use in FDF? Thanks.
Copy link to clipboard
Copied
This code looks like escaped UCS-2. This is standard, but no ordinary text editor will make it mixed with non-Unicode in this way, you need to write software.
Copy link to clipboard
Copied
Oh no. Do you mean Adobe PDF supports only FDF with escaped UCS-2 encoding type when FDF contains Chinese characters? Thanks.
Copy link to clipboard
Copied
I think that must be correct. A better situation than I said originally. Is there a problem with this?
Copy link to clipboard
Copied
Yes since I need to merge FDF into PDF via Python automatically instead of manually. And it works fine for FDF without Chinese characters. Anyway thank you so much for your responses.
Copy link to clipboard
Copied
Tip: notice the first two characters in the string after (. These are similar to "by" but they are not these actual characters, they are "þÿ") These are the BOM (byte order mark meaning the characters are UCS-2). This makes sense because the syntax of FDF objects is just the same as PDF.