Hi,
Please read the release note for Unicode 4.0.1 closely.
This is the FIRST major revision of the Unihan database
since Unicode 3.2. Other changes worth noting.
Cheers,
- Ira
Ira McDonald (Musician / Software Architect)
Blue Roof Music / High North Inc
PO Box 221 Grand Marais, MI 49839
phone: +1-906-494-2434
email: imcdonald at sharplabs.com
-----Original Message-----
From: Patrik Fältström [mailto:paf at cisco.com]
Sent: Wednesday, March 31, 2004 11:02 AM
To: ietf-charsets at iana.org
Subject: Fwd: Unicode 4.0.1 Released
Begin forwarded message:
> From: Rick McGowan <rick at unicode.org>
>> Unicode 4.0.1 has been released! The data files and documentation are
> final and posted on the Unicode site. For details, see the version
> page for
> Unicode 4.0.1 at:
>http://www.unicode.org/versions/Unicode4.0.1/>> Unicode 4.0.1 is an update version of the Unicode Standard. It adds no
> new
> characters. The updated Unicode Character Database files for this
> version
> are available in the 4.0-Update1 directory:
>http://www.unicode.org/Public/4.0-Update1/>> For the unchanged files, see Unicode 4.0.0:
>http://www.unicode.org/versions/Unicode4.0.0/>> The book publication, The Unicode Standard, Version 4.0, together with
> this specification and the online Unicode Standard Annexes and the
> Unicode
> Character Database, define Version 4.0.1 of the Unicode Standard. The
> book
> gives the general principles, requirements for conformance, and
> guidelines
> for implementers, followed by character code charts and names. This
> book
> can be ordered online. Additional characters, clarifications, and
> errata
> are covered in this document.
>> The main new features in Unicode 4.0.1 are the following:
>> 1. The first significant update of the Unihan Database (Unihan.txt)
> since Unicode 3.2.0, including a large number of fixes and
> additional data items.
>> 2. Significant clarifications in four definitions used in conformance.
>> 3. Unicode Character Database:
> * New character properties: STerm and Variation_Selector
> * Updated significantly: Terminal_Punctuation, Math,
> Script, and Line_Break
> * Changed: general category of U+200B ZERO WIDTH SPACE
> * Changed: bidi class of several characters
> * Added: property value aliases
> * Revised: formats in some of the data files
>> 4. Changes in the recommended loose comparison of Character name
> values.
>> 5. Clearer definition of the encoding of Bengali Reph and Ya-phalaa
>