Just as an FYI on the UCS-4 private use space:
On 2000-03-07, the Unicode Technical Comitte submitted document "N2175" to
ISO/IEC JTC1/SC2/WG2 titled:
"Proposal to restrict the range of code positions to the values up to
U-0010FFFF"
Summary:
The Unicode consortium requests a remedy to this situation: the publication
of a technical corrigendum to ISO/IEC 10646-1:2000 which excludes values
above U-0010FFFF. In this corrigendum,
* The private use characters from U-60000000 to U-7F000000 and from
U-00E00000 to
U-00FFFFFF would be removed from the standard.
* A note would be added stating that for interoperability between UTF-8,
UTF-16 and UCS-4, it is not expected that any code positions will ever be
allocated above U-0010FFFF.
URL:
http://anubis.dkuug.dk/jtc1/sc2/wg2/docs/n2175.htm
On 2000-03-24 WG2 accepted the proposal:
(from the minutes of the meeting:
http://anubis.dkuug.dk/jtc1/sc2/wg2/docs/n2204.doc)
WG2 accepts the proposal in document N2175 towards removing the provision
for Private Use Groups and Planes beyond Plane 16 in ISO/IEC 10646, to
ensure internal consistency in the standard between UCS-4, UTF-8 and UTF-16
encoding formats, and instructs its project editor prepare suitable text for
processing as a future Technical Corrigendum or an Amendment to 10646-1:
2000.
FYI,
Bill