Mule bugs: misidentification (Latin-1 vs. Chinese), revert issues

Aidan Kehoe kehoea at parhasard.net
Wed Jan 16 14:46:58 EST 2008


 Ar an séú lá déag de mí Eanair, scríobh Glynn Clements: 

 > Stephen J. Turnbull wrote:
 > 
 > >  > And on OS X file-name-coding-system (and relatedly, the 'file-name
 > >  > coding system alias) is unconditionally UTF-8, independent of the
 > >  > locale coding system.
 > > 
 > > This is a good heuristic, since the most popular file system on OS X
 > > is HFS+, which does try to enforce UTF-8 file names.
 > 
 > Although, OSX encodes accented characters using combining accents
 > rather than using pre-composed characters. IIRC, OS functions will
 > decompose any pre-composed characters automatically, but you need to
 > bear in mind that a byte string obtained from e.g. readdir() won't
 > necessarily match the byte string you passed when creating the file.

Yeah; that’s something we don’t bear in mind at the moment, unfortunately.

 > Needless to say, Unix/Linux just treats composed/decomposed versions
 > of the same text as entirely different filenames.

Yes! Of course /tmp/ä and /tmp/ä are distinct files! Perhaps a smidgin more
surprising is that NTFS does too, to my knowledge.

-- 
¿Dónde estará ahora mi sobrino Yoghurtu Nghé, que tuvo que huir
precipitadamente de la aldea por culpa de la escasez de rinocerontes?




More information about the XEmacs-Beta mailing list