Uwe Brauer writes:
- I am not sure whether kile works correctly but there
is no difference in bytes between the 8859-8 and
cp1255 version. Using the command
file hebrew.txt
both files are classified as 8859-8.
That is correct. The bytes *are* the same. The difference between
ISO 8859/8 and CP 1255 is that the latter has more characters in it,
mostly punctuation encoded in positions that are reserved for control
characters by the ISO standards.
- when I open the cp1255 file or 8859-8 files it is
not. Well I thought: this is a font problem. I have a
couple of 8859-8 fonts installed on my Laptop.
It is not possible for a machine to tell the difference between any of
members of the ISO 8859 family or members of the CP 12xx family. They
use exactly the same bytes. In general it is very difficult to
distinguish any unibyte encoding from any other as the repertoires
overlap almost 100%, and they typically share ASCII and a few other
common characters (such as NO BREAK SPACE).
Try C-u C-x C-f hebrew.txt RET cp1255 RET. If that doesn't work, I'll
need more information.
If you want this recognized automatically, you'll need to fiddle with
the coding-priority-list. The easiest way to do this is probably to
restart XEmacs with "LANG=he_IL.UTF-8 xemacs". (You don't want to do
it with he_IL.ISO-8859-8 or worse he_IL.CP1255 because that will
confuse the locale system, and may make it difficult for XEmacs to
recognize UTF-8.)
_______________________________________________
XEmacs-Beta mailing list
XEmacs-Beta(a)xemacs.org
http://lists.xemacs.org/mailman/listinfo/xemacs-beta