JDK-5077424 : 8-bit character suport for zip entry names
  • Type: Bug
  • Component: core-libs
  • Sub-Component: java.util.jar
  • Affected Version: 1.4.2
  • Priority: P4
  • Status: Closed
  • Resolution: Duplicate
  • OS: windows_xp
  • CPU: x86
  • Submitted: 2004-07-22
  • Updated: 2006-03-10
  • Resolved: 2006-03-10
Related Reports
Duplicate :  
Name: rmT116609			Date: 07/22/2004

When writing zip entries, it appears to me that the Unicode characters used to specify their names get truncated to only 7 bits, so they will use less storage space. This represents a problem when it comes to processing file names in languages such as Portuguese, where the cedilla character, for example, gets converted from \u00E7 to \u0067. I encountered this problem when writing a simple file backup program. I suggest that the platform included support for zip entries with names represented by 8-bit characters at least, or 16-bit, to be in conformance with Unicode.

I strongly believe that this change will contribute to the Java language internationalization process.
(Incident Review ID: 285916) 

EVALUATION I believe this is a misunderstanding. The names of Zip File entries is stored in the Zip File in UTF-8, which can lead to problems because some other Zip file manipulation tools don't necessarily use UTF-8. Many users create zip files entries with non-ASCII characters. Unless the submitter provides a reproducible test case showing that this is not a misunderstanding, we will likely close this bug soon as Not A Bug. ###@###.### 2004-07-23