"Stephen J. Turnbull" <stephen(a)xemacs.org> writes:
>>>>> "Hrvoje" == Hrvoje Niksic
<hniksic(a)xemacs.org> writes:
Hrvoje> That depends on the outlook. I completely understand if
Hrvoje> es-tzet gets unconditionally assigned to Latin 1, and I
Hrvoje> wouldn't call it wrong.
Bad example, I guess. Are there _no_ characters used in Croation
(or other Latin-2-supported languages) that get assigned to Latin 1
by the "use the first ISO 8859 charset that contains the character"
algorithm?
I don't think so. The Croatian script is simple; in addition to ASCII
it uses "special" characters čćđšž which, with their uppercase
versions, amounts to ten characters, all present in Latin 2. Sorting
is a bit more work because it requires correct collation of those
characters and possibly support for digraphs dž, lj, and nj (which
have Unicode code points assigned that no one uses), but that's not an
immediate concern for a text editor, and can be kludged in later by
locally written applications that need that kind of thing.
Note that XEmacs is pretty close to meeting Croatian users' needs in
this; GNU Emacs doesn't recognize the `scaron' etc. keysyms *at all*.