MathGroup Archive 2005

[Date Index] [Thread Index] [Author Index]

Search the Archive

Re: Unicode Support

  • To: mathgroup at smc.vnet.net
  • Subject: [mg55514] Re: Unicode Support
  • From: dh <dh at metrohm.ch>
  • Date: Sun, 27 Mar 2005 02:42:49 -0500 (EST)
  • References: <d234el$lid$1@smc.vnet.net>
  • Sender: owner-wri-mathgroup at wolfram.com

Hello Zhu,
Unicode makes a distinction between characters (code points) and 
codeing. While there are always hex 10FFFF code points, there are 3 
different codings used: UTF-8,UTF-16,UTF-32. All these codings are 
"lossless" (invertible, ont-to-one). UTF-8 uses 1,2,3 or 4 Bytes per 
character. UTF-16 2 to 4 bytes. UTF-32 obviously 4 bytes.
Therefore, there is no new Unicode Standard with 32 bit.

Daniel

Zhu Chongkai wrote:
> Hi all,
> 
> The Mathematica Book says that Mathematica support Unicode Characters.
> And the MathLink tells that a Unicode character in Mathematica is a
> 16-bit. But the latest Unicode Standard uses 32-bit to encode a
> character. It seems to me that Mathematica's Unicode support is
> outdated, based on an old version of Unicode Standard, which only
> contains lass than 65536 characters. Will next version of Mathematica
> use 32-bit encoding? Or am I wrong?
> 
> Cheers,
> Zhu Chongkai
> http://www.neilvandyke.org/mrmathematica/
> 


  • Prev by Date: Re: Upgrading to v5.1?
  • Next by Date: Re: Writing result of "Do" in a single List
  • Previous by thread: Unicode Support
  • Next by thread: Re: Unicode Support