JDK-6795538 : Inconsistences in case of charset names
  • Type: Bug
  • Component: core-libs
  • Sub-Component: java.nio.charsets
  • Affected Version: 6
  • Priority: P4
  • Status: Open
  • Resolution: Unresolved
  • OS: windows_xp
  • CPU: x86
  • Submitted: 2009-01-20
  • Updated: 2011-02-16
Description
FULL PRODUCT VERSION :
java version "1.6.0_03"
Java(TM) SE Runtime Environment (build 1.6.0_03-b05)
Java HotSpot(TM) Client VM (build 1.6.0_03-b05, mixed mode)

ADDITIONAL OS VERSION INFORMATION :
Windows XP SR-2

A DESCRIPTION OF THE PROBLEM :
1.) In sun.nio.cs.standard-charsets canonical name is defined as  "X-UTF-32BE-BOM", but in jdk1.6.0_11/docs/technotes/guides/intl/encoding.doc.html it is defined by lowercase 'x'.
2.) In sun.nio.cs.standard-charsets canonical name is defined as "X-UTF-32LE-BOM", but in jdk1.6.0_11/docs/technotes/guides/intl/encoding.doc.html it is defined by lowercase 'x'.
3.) In sun.nio.cs.standard-charsets aliases are defined as "cpxxx", but in jdk1.6.0_11/docs/technotes/guides/intl/encoding.doc.html they are defined by uppercase "Cpxxx".
4.) In http://www.iana.org/assignments/character-sets names are often defined by uppercase letters, but not in sun.nio.cs.standard-charsets or sun.nio.cs.ext.ExtendedCharsets. Examples: , CP273
5.) In  jdk1.6.0_11/docs/technotes/guides/intl/encoding.doc.html historical name is "JIS_X0201", but in sun.io it's "ByteToCharJIS0201", and in sun.nio.cs.JIS_X_0201 it's "JIS0201" and in sun.io.CharacterEncoding it's completely missing

These all are only examples, there could be found many more.



REPRODUCIBILITY :
This bug can be reproduced always.

Comments
EVALUATION charset names are not case-sensitive. wait for new round of doc re-work.
20-01-2009