Re: Unicode Support
- To: mathgroup at smc.vnet.net
- Subject: [mg55514] Re: Unicode Support
- From: dh <dh at metrohm.ch>
- Date: Sun, 27 Mar 2005 02:42:49 -0500 (EST)
- References: <d234el$lid$1@smc.vnet.net>
- Sender: owner-wri-mathgroup at wolfram.com
Hello Zhu, Unicode makes a distinction between characters (code points) and codeing. While there are always hex 10FFFF code points, there are 3 different codings used: UTF-8,UTF-16,UTF-32. All these codings are "lossless" (invertible, ont-to-one). UTF-8 uses 1,2,3 or 4 Bytes per character. UTF-16 2 to 4 bytes. UTF-32 obviously 4 bytes. Therefore, there is no new Unicode Standard with 32 bit. Daniel Zhu Chongkai wrote: > Hi all, > > The Mathematica Book says that Mathematica support Unicode Characters. > And the MathLink tells that a Unicode character in Mathematica is a > 16-bit. But the latest Unicode Standard uses 32-bit to encode a > character. It seems to me that Mathematica's Unicode support is > outdated, based on an old version of Unicode Standard, which only > contains lass than 65536 characters. Will next version of Mathematica > use 32-bit encoding? Or am I wrong? > > Cheers, > Zhu Chongkai > http://www.neilvandyke.org/mrmathematica/ >