RE: String comparison under Latin-1 and Unicode

Show/hide message thread

RE: String comparison under Latin-1 and Unicode Ben Goetter (10 Mar 2000 20:47 UTC)

Re: String comparison under Latin-1 and Unicode Sergei Egorov (10 Mar 2000 21:41 UTC)

RE: String comparison under Latin-1 and Unicode Ben Goetter 10 Mar 2000 20:46 UTC

> it seems to me that STRING<? and
> others are better left for trivial tasks like sorting strings of digits;

Very few such trivial tasks exist in the application domain.

> for example, if you want to implement string sets as
> sorted lists, it's much better to use fast ordering predicate,

I argue that this is the only time that a language-insensitive ordering
predicate is useful: implementing ADTs on top of strings.  And for that,
the character-ordering predicates remain.  (Indeed, the character-ordering
predicates,
much like the character-wise casemapping procedures, seem useful for little
else.)

> especially because collation is actually a complex process
> involving generation of "collation keys" which can be reused:

Agreed.  But for single-pass cases where you don't want to cache and reuse
sort keys, I would like the string preds to do the right thing, which, I
claim,
is language-sensitive collation.

So it's back to the string procs vs text procs thread of a couple of months
ago.

> the other hand, some Schemes have already implemented
> extended versions of these predicates accepting more than
> two arguments to make them similar to < and others

I see.  Might foul the (APPLY STRING<? some-set-of-strings) case
if it had to check the type of its last argument.

I also want (WITH-LANGUAGE ...) to solve the Turkish casemapping issue.
Separate point.