On Friday 20 September 2002 11:27, you wrote:
>>>>> "Steven" == Steven T Hatton
<hattons(a)speakeasy.net> writes:
Steven> I also want all my files to be opened and saved as UTF-8.
(set-coding-priority-list '(utf-8 no-conversion))
I'm assuming from what you've said before that you REALLY want ALL
files opened as UTF-8. The above means that as far as Mule is
concerned if it's not UTF-8, it's binary. If not,
(set-coding-priority-list '(utf-8))
I want to be sure that any text files I'm working with will be understood by
programs such as xsltproc, and Java apps, these apps correctly understand
them as UTF-8.
Mule will try UTF-8 first, and if that doesn't make sense,
various
ISO-2022, Shift JIS, Big 5, etc.
I believe there may be a way to declare a list of priorities. After I get
some sleep, I'll look into it.
Steven> I don't know the significance of this, but there
is at
Steven> least one character which is UFT-32 which I would like to
Steven> be able to use. Does that mess up everything?
Probably. I suspect Mule-UCS will "corrupt" any character that is not
known to it as a regular Mule character. Emacs does not have a
standard way to represent such characters. AFAIK UTF-32 (by which I
presume you mean 4-octet UTF-8) is not a problem as such (XEmacs can
handle up to 31-bit integers, so full UCS-4 is not OK, but UTF-32 is).
So, the solution is simple, in principle: create a private charset,
map it to fonts, and teach Mule-UCS about it. (I suspect this
solution is also general in the sense that we could create such a
charset on the fly and just stick the characters in there, adding new
ones to the charset and to the Mule-UCS database as we detect them.)
Well we can't steel their font images, but we can draw pictures of them:
http://www.unicode.org/charts/
How do you, uh, "normally" access this character?
Hard copy. It's Latin Extended B 01EA, and 01EB. You can find it on the
above linked site.
(This is the first
time I've actually heard of a case where UTF-32 is needed!) Do you
have fonts and an input method?
There are fonts for for most characters I want to use. I have input methods
for many of them, but these are not necessarily pleasant.
I've found looking at the following files to be a bit instructive:
http://66.92.149.152/open-source/org/xf86/en_US.UTF-8/
http://66.92.149.152/open-source/org/xf86/en_US.UTF-8/Compose
http://66.92.149.152/open-source/org/xf86/en_US.UTF-8/XI18N_OBJS
http://66.92.149.152/open-source/org/xf86/en_US.UTF-8/XLC_LOCALE
They are off my SuSE box. I'm pretty sure they are the prodocut of XFree86.
Mozilla does a pretty good job with these. XEmacs handles many of the fonts,
but it requiered a bit of persuasion.
-- STH