|
SRFI 207: String-notated bytevectors
Arthur A. Gleckler
(15 Aug 2020 23:29 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Per Bothner
(16 Aug 2020 00:31 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Alex Shinn
(16 Aug 2020 01:16 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Lassi Kortela
(16 Aug 2020 10:15 UTC)
|
||
|
(missing)
|
||
|
Re: SRFI 207: String-notated bytevectors
Lassi Kortela
(16 Aug 2020 10:40 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
John Cowan
(17 Aug 2020 03:18 UTC)
|
||
|
bytestring procedure
Lassi Kortela
(17 Aug 2020 07:56 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(17 Aug 2020 16:10 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Shiro Kawai
(18 Aug 2020 00:19 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors Lassi Kortela (18 Aug 2020 06:51 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(18 Aug 2020 07:04 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Daphne Preston-Kendal
(18 Aug 2020 09:53 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(18 Aug 2020 10:14 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Shiro Kawai
(18 Aug 2020 10:50 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Lassi Kortela
(18 Aug 2020 10:57 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(18 Aug 2020 11:22 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
John Cowan
(18 Aug 2020 15:49 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(18 Aug 2020 16:12 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Daphne Preston-Kendal
(18 Aug 2020 16:38 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(18 Aug 2020 17:00 UTC)
|
||
|
(missing)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(18 Aug 2020 18:49 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
John Cowan
(18 Aug 2020 22:30 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Shiro Kawai
(19 Aug 2020 20:38 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(19 Aug 2020 20:44 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
John Cowan
(19 Aug 2020 21:55 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Shiro Kawai
(20 Aug 2020 00:54 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Daphne Preston-Kendal
(20 Aug 2020 06:04 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Shiro Kawai
(20 Aug 2020 06:09 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(20 Aug 2020 06:33 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(18 Aug 2020 17:43 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
John Cowan
(18 Aug 2020 17:49 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Daphne Preston-Kendal
(18 Aug 2020 18:31 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(18 Aug 2020 16:16 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Daphne Preston-Kendal
(18 Aug 2020 09:48 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Marc Nieper-Wißkirchen
(18 Aug 2020 10:02 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Lassi Kortela
(18 Aug 2020 10:27 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Lassi Kortela
(18 Aug 2020 10:28 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Daphne Preston-Kendal
(16 Aug 2020 10:31 UTC)
|
||
|
Re: SRFI 207: String-notated bytevectors
Lassi Kortela
(16 Aug 2020 10:10 UTC)
|
||
> I think the main reason is that if we allowed the full Unicode range
> and specified UTF-8 encoding, sequences like \x80; would be ambiguous.
> Either the byte 80 is meant or the bytes corresponding to the UTF-8
> encoding of U+0080.
>
> I agree. With this regard, I think the syntax Alex suggested earlier
> seems to work well. This way, octed sequence that's not valid as utf-8
> can be included without ambiguity:
>
> #u8("abcde" #x80 "efghi")
>
> In the string part, we can say either ASCII-only, or utf-8 encoded string.
That syntax also had the advantage that the R7RS #u8(...) vs R6RS
#vu8(...) discrepancy is dodged. Implementations would allow strings
inside whichever # prefix they currently have. In case they support both
prefixes, allow it in both.
> In order to allow implementations to extend #u8"..." so that "..." can
> be any string allowed by the implementation (*), I want to suggest to
> rename the sequence "\xHH;" of this SRFI into something different like
> "\yHH;". I think this is a good thing because "\xHH;" in strings and
> characters really means something different for bytes greater than
> #x7F as soon as we encode it in UTF-8 (which string->utf8 does).
I'd vote to support fewer kinds of characters in bytestring literals to
avoid confusion. Here the #u8("foo") syntax would have the advantage
that we'd have a good excuse to drop \x; escapes completely :) The
bytevector #u8("Hello" #x20 "world" #x0a) would be equivalent to
u8"Hello\x20;world\x0a;" in the current draft without using any escapes.
If \xHH; is not supported, confusion about its meaning and permitted
range of characters is also avoided.