Mule bugs: misidentification (Latin-1 vs. Chinese), revert issues

Stephen J. Turnbull stephen at xemacs.org
Wed Oct 10 19:35:07 EDT 2007


Michael Sperber writes:

 > I don't even get that far.

What are you trying to do?

 > Is there a brief overview somewhere on how locale influences the
 > various parts of XEmacs?  Specifically, how do file names work?  I
 > have

I forget what is done in 21.4, but in 21.5 `file-name-coding-system'
controls conversion of file names.

 > (get-coding-system-from-locale (current-locale)) => utf-8
 > 
 > Yet XEmacs treats a filename called "äää" (a umlaut, a umlaut, a umlaut)
 > as if it were called "a box a box a box" (i.e. apparently the UTF-8
 > encoding of the file name).

This is a Unicode issue, I'm pretty sure.  I suspect you're on a Mac
with HFS+, and what you see is not (a umlaut, a umlaut, a umlaut) but
rather (a, combining umlaut, a, combining umlaut, a, combining umlaut)
because the Mac puts file names into NFD or NFKD (canonical decomposed
normalization form).  XEmacs doesn't know about the various Unicode
normalizations, and it will take some effort to incorporate them.

Note that it can't possibly be the UTF-8 encoding of the file name
because all accented characters have high-bit set representations in
UTF-8.




More information about the XEmacs-Beta mailing list