>>>> "Hrvoje" == Hrvoje Niksic
<hniksic(a)xemacs.org> writes:
Hrvoje> That depends on the outlook. I completely understand if
Hrvoje> es-tzet gets unconditionally assigned to Latin 1, and I
Hrvoje> wouldn't call it wrong.
Bad example, I guess. Are there _no_ characters used in Croation (or
other Latin-2-supported languages) that get assigned to Latin 1 by the
"use the first ISO 8859 charset that contains the character"
algorithm? For the extreme case consider Latin 9, where everything,
except the Euro, gets assigned to another Latin character set.
Hrvoje> On the other hand, Croatian XEmacs users, including
Hrvoje> myself, *do* expect š and ž (which they have on their
Hrvoje> keyboards and use daily) to be part of Latin 2, so that
Hrvoje> they can safely enter all Croatian chars and save the
Hrvoje> buffer as Latin 2.
*sigh* It's such a shame the Mule designers had to be Japanese. ISO
8859 very clearly[1] thinks of the various "parts" (ie, Latin 1, Latin
2, ISO Greek, etc) as _subsets_, and you should be able to save a
buffer in any of the subsets which contains all the buffer characters.
Only Japanese would have such a strong bias toward implementing the
ISO 8859 character sets as disjoint.
We really, really need to go to Unicode internally to force this mess
to change.
Footnotes:
[1] In hindsight. I realized this thanks to your rants.
--
School of Systems and Information Engineering
http://turnbull.sk.tsukuba.ac.jp
University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN
Ask not how you can "do" free software business;
ask what your business can "do for" free software.