Email list hosting service & mailing list manager


RE: CHAR-{UP,DOWN}CASE & super-ASCII encodings Ben Goetter 25 Jan 2000 19:43 UTC

> I think the sleazes I specify for patching CHAR-{UP,DOWN}CASE are
acceptable,

I agree.  I meant, after subsuming all of your sleazen.

> Heh? What about Latin-1's es-szet? Decomposing turns it into "ss" for
sure?
> Or leaves it as es-szet?

As I read the specification, sharp-s does not mechanically decompose
into ss.  I was conflating the recent German spelling reform with
decomposition.  My mistake.

> What is done with downcasing Unicode's position-sensitive Greek
letters?

Lowercase sigma is the only such Greek letter.  Follow the official
Unicode data-table, which dictates that U+3A3 maps to U+3C3, i.e.,
non-final sigma.  That is correct at the character level, though not at
the string or text-block level.  Initial/medial-sigma and final-sigma
are the same letter, just written differently.  (This should be a
shaping issue, like writing Arabic, not a discrete character value.  But
it isn't.)