Email list hosting service & mailing list manager

case mappings Alex Shinn (13 Jul 2005 03:57 UTC)
Re: case mappings Thomas Bushnell BSG (13 Jul 2005 05:49 UTC)
Re: case mappings Michael Sperber (13 Jul 2005 06:41 UTC)
Re: case mappings Thomas Bushnell BSG (13 Jul 2005 06:47 UTC)
Re: case mappings Michael Sperber (13 Jul 2005 07:12 UTC)
Re: case mappings Thomas Bushnell BSG (13 Jul 2005 07:21 UTC)
Re: case mappings bear (13 Jul 2005 17:24 UTC)
Re: case mappings Thomas Bushnell BSG (13 Jul 2005 19:35 UTC)
Re: case mappings Alex Shinn (13 Jul 2005 07:55 UTC)
Re: case mappings Alex Shinn (13 Jul 2005 07:40 UTC)
Re: case mappings Thomas Bushnell BSG (13 Jul 2005 19:36 UTC)
Re: case mappings Alex Shinn (14 Jul 2005 02:39 UTC)
Re: case mappings Thomas Bushnell BSG (14 Jul 2005 07:15 UTC)
Re: case mappings Alex Shinn (14 Jul 2005 07:42 UTC)
Re: case mappings Thomas Bushnell BSG (14 Jul 2005 08:07 UTC)
Re: case mappings Alex Shinn (14 Jul 2005 08:24 UTC)
Re: case mappings bear (14 Jul 2005 16:47 UTC)
Re: case mappings Thomas Bushnell BSG (14 Jul 2005 20:29 UTC)
Re: case mappings bear (15 Jul 2005 18:23 UTC)
Re: case mappings Thomas Bushnell BSG (15 Jul 2005 19:52 UTC)
Re: case mappings Matthew Flatt (13 Jul 2005 13:05 UTC)
Re: case mappings Thomas Bushnell BSG (13 Jul 2005 19:39 UTC)
Re: case mappings Alex Shinn (14 Jul 2005 02:31 UTC)

Re: case mappings bear 14 Jul 2005 16:47 UTC


On Thu, 14 Jul 2005, Thomas Bushnell BSG wrote:

> From my perspective, the problem with the current standard is
> that you *cannot* implement Unicode properly.  Far from
> requiring it, it is essentially prohibited.

> I fear that this SRFI is (ironically) in danger of doing the
> same damn thing all over again.

> If Scheme standardizers would just get out of the way, and
> allow Unicode-interested Schemes to implement Unicode
> correctly, I would be happy.  My concern is that I want to make
> sure that in the next go-round of the RnRS, it is possible to
> write a Unicode conformant Scheme system.

I think this is my essential position too.  Requiring a specific
form of unicode support (or requiring it at all) may be
premature; but at the very least removing requirements that
militate *AGAINST* proper unicode support must be done.

Making symbols case-sensitive and not requiring case operations
that work on single characters, I believe, is enough to make it
possible for interested implementors to create fully unicode
compliant schemata.

The standard may also require case operations that work on
strings without breaking the ability of an implementor to be
fully compliant with both RnRS and Unicode.  But this is
optional, and can be relegated to a library function.

Many implementors may choose to also have case operations that
work (non-compliantly) on single characters, but IMO the standard
must not require them.

This is not, however, enough to make it possible for these
schemata to handle unicode with any kind of uniform semantics.
For example, for a given external representation, I can imagine
different systems thinking that it was any of six different
lengths, depending on whether they do normaliztion, what choice
of normalization form they use, and whether they map characters
to codepoints or grapheme clusters (making normalization moot).

A standard could iron out these issues; but in the absence of
experience and the reports of satisfied or unsatisfied users, it
may actually be better if the standard does not.

				Bear