JDK-4176141 : ja collation sequence fails with Unicode 2.1.2 decomposition data
  • Type: Bug
  • Component: core-libs
  • Sub-Component: java.text
  • Affected Version: 1.2.0
  • Priority: P4
  • Status: Resolved
  • Resolution: Fixed
  • OS: generic
  • CPU: generic
  • Submitted: 1998-09-24
  • Updated: 2002-10-07
  • Resolved: 2002-10-07
The Version table provides details related to the release that this issue/RFE will be addressed.

Unresolved : Release in which this issue/RFE will be addressed.
Resolved: Release in which this issue/RFE has been resolved.
Fixed : Release in which this issue/RFE has been fixed. The release containing this fix may be available for download as an Early Access Release or a General Availability Release.

To download the current JDK release, click here.
Other
1.4.2 mantisFixed
Related Reports
Relates :  
Description

Name: bb33257			Date: 09/24/98


RuleBasedCollator fails to build the Japanese collation sequence 
using Unicode 2.1.2 database.  The problem is that the decomposition 
table (Normalizer) is decomposing math symbols in the Japanese
sequence into special symbols.  These special symbols need to be 
quoted to be understood by the parser.  For example, \u2260 (Not 
Equal To) is decomposed to \u003D (=) and \u0338 (overlay combining
mark).
The fix is to decompose the characters when the pattern entries
are built.  This would require each pattern entry to call Normalizer.
decompose().  There may be performance issues involved.
======================================================================

Comments
CONVERTED DATA BugTraq+ Release Management Values COMMIT TO FIX: mantis FIXED IN: mantis INTEGRATED IN: mantis mantis-b03
14-06-2004

WORK AROUND Name: bb33257 Date: 09/24/98 Delete the math symbols from the Japanese collation sequence. ======================================================================
11-06-2004

EVALUATION In JDK 1.2fcs, we don't use Unicode 2.1.x-based Normalizer tables yet, so this is not an issue yet. norbert.lindenberg@Eng 1998-09-25
25-09-1998