New character encoding
I've invented a Character encoding.
Advantages of this: ANSI ASCII compatible, Bitwise operations based, Self-synchronising, Abundant.
Yields of this encoding, against those of UTF-8, are,
Number of Bits This Encoding UTF-8
Number of Codes Accumulation Number of Codes Accumulation
8 128 128 128 128
10 192 320 0 128
12 512 832 0 128
14 1280 2112 0 128
16 3072 5184 2048 2176
18 7168 12352 0 2176
20 16384 28736 0 2176
22 36864 65600 0 2176
24 81920 147520 65536 67712
26 180224 327744 0 67712
28 393216 720960 0 67712
30 851968 1572928 0 67712
32 1835008 3407936 2097152 2164864
This is a new Character encoding scheme (CES) that maps Unicode code points to bit sequences.
Could you please suggest improvements?
Please bear with me as the table may not be formatted well for you, especially when using serif. When reading a line in the table, the first value is number of bits and the next pair is for this encoding, and the other pair is for UTF-8, with the first of each pair being the number of codes and the others are their accumulation.
Sorry! I wish to provide more details on this but I'm restricted for some time. I hope that this does not stop you from assisting me.
Regards
Anbu
This encoding maintains almost all the properties of UTF-8 in a more compact format. ANSI ASCII compatiblity, Bitwise operations based, Self-synchronising and Abundance are some of the properties of this encoding. Further, this encoding encodes all characters in far fewer number of bits than UTF-8 as shown in the table. As I had mentioned earlier I will provide more details and proofs soon. This post is a request for suggestions. Please suggest the most suitable place for this post.