Unicode lambda
Lassi Kortela
(12 May 2019 10:19 UTC)
|
Re: Unicode lambda
Shiro Kawai
(12 May 2019 11:18 UTC)
|
Re: Unicode lambda
Lassi Kortela
(12 May 2019 11:40 UTC)
|
Re: Unicode lambda
Lassi Kortela
(12 May 2019 11:50 UTC)
|
Re: Unicode lambda
Shiro Kawai
(12 May 2019 12:06 UTC)
|
Re: Unicode lambda
Marc Nieper-Wißkirchen
(12 May 2019 12:11 UTC)
|
Re: Unicode lambda
Lassi Kortela
(12 May 2019 12:23 UTC)
|
Re: Unicode lambda
Lassi Kortela
(12 May 2019 13:23 UTC)
|
Re: Unicode lambda
Lassi Kortela
(12 May 2019 13:46 UTC)
|
Re: Unicode lambda
John Cowan
(12 May 2019 14:20 UTC)
|
Re: Unicode lambda
Lassi Kortela
(12 May 2019 14:38 UTC)
|
Re: Unicode lambda
Lassi Kortela
(12 May 2019 14:55 UTC)
|
Re: Unicode lambda
John Cowan
(12 May 2019 15:00 UTC)
|
Re: Unicode lambda
Lassi Kortela
(12 May 2019 15:20 UTC)
|
Re: Unicode lambda
Shiro Kawai
(12 May 2019 18:42 UTC)
|
Re: Unicode lambda Lassi Kortela (12 May 2019 19:43 UTC)
|
Re: Unicode lambda
John Cowan
(12 May 2019 22:29 UTC)
|
Re: Unicode lambda
Shiro Kawai
(13 May 2019 10:48 UTC)
|
Re: Unicode lambda
Lassi Kortela
(14 May 2019 08:25 UTC)
|
Re: Unicode lambda
Marc Nieper-Wißkirchen
(14 May 2019 08:50 UTC)
|
Re: Unicode lambda
Lassi Kortela
(14 May 2019 10:10 UTC)
|
Re: Unicode lambda
Lassi Kortela
(14 May 2019 10:59 UTC)
|
Re: Unicode lambda
Lassi Kortela
(14 May 2019 12:35 UTC)
|
Re: Unicode lambda
Lassi Kortela
(14 May 2019 13:09 UTC)
|
Re: Unicode lambda
Lassi Kortela
(14 May 2019 14:04 UTC)
|
Re: Unicode lambda
Shiro Kawai
(14 May 2019 19:18 UTC)
|
Re: Unicode lambda
Vincent Manis
(14 May 2019 22:01 UTC)
|
Re: Unicode lambda
Lassi Kortela
(20 May 2019 09:21 UTC)
|
Re: Unicode lambda
Marc Nieper-Wißkirchen
(21 Oct 2019 14:20 UTC)
|
Re: Unicode lambda
Shiro Kawai
(21 Oct 2019 17:19 UTC)
|
Re: Unicode lambda
John Cowan
(21 Oct 2019 17:39 UTC)
|
Re: Unicode lambda
Marc Nieper-Wißkirchen
(21 Oct 2019 18:43 UTC)
|
Re: Unicode lambda
John Cowan
(21 Oct 2019 23:27 UTC)
|
Encoding declarations
Lassi Kortela
(22 Oct 2019 08:39 UTC)
|
Re: Encoding declarations
John Cowan
(22 Oct 2019 20:52 UTC)
|
#! directives, general and specific
Lassi Kortela
(22 Oct 2019 09:11 UTC)
|
Re: #! directives, general and specific
John Cowan
(22 Oct 2019 20:27 UTC)
|
Re: #! directives, general and specific
Lassi Kortela
(22 Oct 2019 20:43 UTC)
|
Re: Unicode lambda
Marc Nieper-Wißkirchen
(13 May 2019 08:50 UTC)
|
Re: Unicode lambda
Lassi Kortela
(13 May 2019 10:27 UTC)
|
Re: Unicode lambda
Per Bothner
(12 May 2019 14:17 UTC)
|
Re: Unicode lambda
Peter
(12 May 2019 15:06 UTC)
|
> - Probably we want something like #!coding=<value>, rather than just > #!<identifier>. E.g. #!coding=utf-8 instead of #!utf-8. > The latter can eat up #!-namespace quickly. This is why I suggested using ordinary S-expressions. When we start with a simple syntax like #!symbol, eventually we'll want #!symbol=value. Then #!symbol1=value1&symbol2=value2 would be nice. Later we want nested data and end up with a contrived alternative syntax for S-expressions, when we could just have used them from the start :) This is like a syntax version of Zawinski's Law ("Every syntax expands until it can encode nested lists and named fields") or Greenspun’s Tenth Rule ("Any sufficiently complicated custom syntax contains an ad-hoc notation for S-expressions"). Similar comments have been made about network protocols. "I know: I'll encode only the fields I need. That way it will be simple." A decade later, extensions are buggy and hairy. The notoriously complex RFC 822 could have just been an S-expression. I have done this as many times as everyone else. "I know, this time I'll get it _right_! I'll only specify what I need. It will be simple!" After failing every time, I have now mostly learned my lesson, and no longer use any format that doesn't have nested lists and named fields :) An even more insidious thing is that when you have a limited syntax, you don't think about useful new extensions, or subconsciously ignore them because you feel how painful it would be to extend the syntax. So useful tools do not get built for decades. For example, adding nested data to the #!a=b&c=d syntax would probably make sense, but the syntax would look so strained that people subconsciously dismiss any such thought. > One issue of S-expression metadata is that S-expression reader is more > involved than a simple finite automata S-expressions are more code to implement up-front than a custom format (if you already have a regexp library for the latter), but they are more uniform and reliable and the syntax will never get more complex no matter how much it's extended. So you pay the cost only once. > - Recognizing coding[=:]<value> (without #!) can work with the editor. > In Emacs, adding -*- coding: <coding> -*- immediately > switches the buffer encoding automatically. It's true that this is a major advantage of the regexps patterns. In Emacs one can use 'auto-coding-functions' and 'auto-coding-alist' to add detectors. Conceivably a detector could come with the standard scheme-mode. A magic comment can also be shared with other languages. But there is no principled standard syntax to which all/most magic comments conform and it's unlikely that such a standard will emerge. > But I agree that it's getting moot as utf-8 dominates. The only > concern is that, since R7RS-small doesn't require full unicode > support, the R7RS implementation that only supports ascii needs a way to > reject source code using greek-lambda gracefully. The spread of UTF-8 does indeed make our lives easier :) The Unicode lambda could come as a macro from (import (srfi xyz)). Non-Unicode implementations would not support that SRFI so the import would fail before parsing gets to the lambda.