Supporting Southeast Asian Languages, Optional ICU Support?
Greetings!
I'm working in Cambodia currently and I have worked in Laos too. I lived in Thailand for a while and helped a friend with a Burmese project. I created one of the first Lao Android apps that didn't use Zero-Width Spaces (ZWSPs) for content. I previously worked in a team that helped supply the International Components for Unicode (ICU) with the data they needed to provide word segmentation (automatic line-breaking).
I enjoy InDesign! It's easy to use and gets the job done!
Well... Except in Lao, Khmer, Thai, and Burmese... (the total of which is a population of about 145 million people or so, not to mention those who've moved to other countries)
Adobe seems to be using ICU in many of its products (it's on the list of ICU's users). But, for some reason, intentional or unintentional - ICU word segmentation isn't turned on for these 4 major SEA languages. Even a system option to enable it deep within preferences would be cool.
So, thus my question...
What do I need to do in order to create a plugin, in C++, which imports ICU's libraries, parses the text with ICU to segment the words, and then adds in ZWSPs?
Or better yet, if InDesign is already using ICU on the backend, can one of you wonderful, delightful, amazing Adobe developers please provide a way to turn on SEA word segmentation in that Adobe World-Ready Paragraph Composer??? I'll buy you a pizza. Just imagine, you could be eating pizza right now... Yes... Yes, you could... PIZZA ![]()
And if you do, my wonderful adobe developer friend, you might want to provide a mechanism to disable it - because I know a lot of people who prefer ZWSPs because they like editing hidden characters in their text for someone crazy reas... *cough* I mean, they prefer finer-grained control. ![]()
Here's what the word God (ພຣະອົງ) looks like when He's been chopped in half:
The transliteration of this split Lao word (highlighted) would be like"pr" (on the end of the line) and "aong" on the second.
It would be like me saying:
Th
ank you!
Remember... There's pizza involved.
