getting closer

Tuesday, 22 November 2005


        i can now compile and run with unicode-internal and even load up 
etc/HELLO, where i duly see "zdravstvuite" in cyrillic and "geia sas"
in 
greek.  the three versions of the first char after GB/JIS/KSC all report 
the same char values, which is a good sign.  the big5 chars appear as ~; 
this is because they are encoded using the fake big5-1/big5-2 charsets, 
which unicode-internal knows nothing about.  probably i should redo 
etc/HELLO to use extended segments to encode big5.  old-mule has 
improvements, too; it knows how to handle arbitrary unicode chars (at 
least, anything up through 0x31FFF) and allows more charsets than before 
-- 96 private in dimension 1 and another 96 private in dimension 2.  
more testing needed before it will be committed and font-handling in 
unicode-internal still needs an overhaul.

ben

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

1999

1998

getting closer