Re: Surrogates and character representation
Alan Watson 24 Jul 2005 17:31 UTC
> FWIW, I now think (after some talk on a private Unicode list) that it's
> correct to allow surrogates as Scheme characters; that is, the range of
> char->integer should be 0 to #x10FFFF.
Hmm. That would seem to prevent an implementation representing strings
internally using UTF-8. This is convenient in some contexts as Scheme
strings can be trivially converted to UTF-8 C strings.
Regards,
Alan
--
Dr Alan Watson
Centro de Radioastronomía y Astrofísica
Universidad Astronómico Nacional de México