>>>> "Hrvoje" == Hrvoje Niksic
<hniksic(a)srce.hr> writes:
Hrvoje> 31-bit integers are nice, but they are hardly that
Hrvoje> important to a normal user.
But when Olivier gets his UCS-4 text manipulation stuff working....
Although extremely few users will want to use UCS-4 for itself, the
least buggy quick path to a wide-char Mule in multilingual contexts
uses UCS-4 in the implementation. UCS-4 characters, by a strange
coincidence, just barely fit into a 31-bit integer.
Unifying the Han ideographs through UCS-2/Unicode is possible, but
will surely introduce new coding-system I/O bugs. The UCS-4 approach
will allow us to change the current low-level buffer implementation
with minimal impact on higher-level Mule code by preserving the
coding-system information at the character level as the current Mule
implementation does.
I think there's a pretty good argument here for pushing at least for
Mule builds to get minimal tagbits by default, maybe not for 21.x, but
soon.