Duplicate :
|
Name: bb33257 Date: 10/26/99 Unicode is more than just wide ASCII. One of the principal operations on Unicode is to normalize text. This is often known as text "decomposition" or "composition". In decomposition, composed accented characters such as "��" are converted into two characters, in this case "o" followed by a separate umlaut character. In composition, the reverse occurs. This is described in UTR #15. A limited normalizer class is a part of JDK 1.1, but it is not exposed to clients. This class needs to be enhanced and made public. XML, JavaScript, and other standards will require this capability. (Review ID: 97055) ======================================================================