Al Petrofsky <xxxxxx@petrofsky.org> schrieb am Fr., 21. Juli 2017 um 01:06 Uhr:
On Thu, Jul 20, 2017 at 5:53 AM, Marc Nieper-Wißkirchen <xxxxxx@nieper-wisskirchen.de> wrote
So the question is: What does "appear in" means, e.g. whether it uses free-identifier=? or bound-identifier=? as the equality predicate of the set(oid) of <literal>s
R7RS further says: "An identifier appearing within a <pattern> can be an underscore (_), a literal identifier listed in the list of <literal>s, or the <ellipsis>. All other identifiers appearing within a <pattern> are pattern variables."

Again, "appear in" is used. And here it is clear that is must be "free-identifier=?", otherwise the underscore or the ellipsis would not be matched by their denotation. (Or "appear in" would suddenly change its meaning in the very same sentence, which is absurd.)

I would say that "X appears in Y" simply refers to the S-expression concept of X being some descendant element of Y. R7RS 4.3.2 also says "It is an error for the same pattern variable to appear more than once in a <pattern>. "  I think you agree that determining whether pattern variables are the "same" should be done using bound-identifier=?, so I think that kills your theory that the verb "appear" can only be used in connection with free-identifier=? comparisons.

I noticed this as well, but this "appear" is a predicate of another sentence, so a bit less "absurd". But I agree that this makes my argument less convincing.
 
So it appears that the question is not what "appears" means, but rather what the meaning of "be" be.  What does it mean for "an identifier appearing within a <pattern>" to "be an underscore" or "be ... a literal identifier listed in the list of <literal>s"?

That's a nice way out of my argumentation (whch would have led to less fortunate semantics of the R7RS)!
 
R7RS 4.3.2 also requires us to identify "Identi fiers that appear in the template but [be] not pattern variables".

I would say that the rule of being that's most in accord with the concept of "lexical scope" is that when you're trying to determine whether an identifier's meaning has been changed by a declaration that's lexically present in the syntax-rules form (i.e., checking whether an identifier in a template has been declared a pattern variable by some identifier in the pattern, or checking whether an identifier in a pattern has been declared a literal by some identifier in the literals list), you use bound-identifier=?.  And when you're checking whether some free identifier (i.e., one not affected by any of the declarations in the syntax-rules form) should have some special meaning because (1) it's an identifier like _ or ... that was declared to be special in the definition of syntax-rules in the top-level environment, and (2) the environment of the syntax-rules form does not contain any shadowing binding of the identifier, then you use free-identifier=? to do the comparison.

Irrespectively of my former reading of the R7RS (which I outlined in my previous post), I also do think that using bound-identifier=? makes (much) more sense when testing for literals in the pattern variables.

There is three more arguments in favour of using bound-identifier=?. Consider

(define-syntax foo
  (syntax-rules ... (...)
    ((foo ...) 'foo)))

In the only rule, Is ... the ellipsis or the literal ...?

SRFI 46, which was voted into the R7RS as is, says: "This [the ellipsis] identifier's specification is considered to be a binding whose scope is the rules of the transformer. The macro system implementation must make the hygienic arrangements described in R5RS's section 4.3 to preserve the lexical scope of these bindings."

So if an implementation used free-identifier=?, "..." would not be detected as a literal; if an implementation used bound-identifier=?, it would be. And the latter is much less confusing to the user.

In particular, in the former case, the following would behave differently:

(define-syntax foo
  (syntax-rules ... (...)
    ((foo ...) 'foo)))

The second reason why bound-identifier=? is likely be what was intended lies again in the quotation from SRFI 46 from above. For specifying the ellipsis, the lexical scope is mentioned explicitly. It would be confusing if literals were not playing by the same rules.

The last reason is based on history, or rather backward-compatibility. All the R5RS and R6RS Schemes I tested use bound-identifier=?. The two sample implementations (by Al Petrofsky) cited in SRFI 46 (which is the basis of the R7RS macro system) also use bound-identifier=?.

As no intention that R7RS deliberately wanted to break compatibility with existing Schemes and standards with respect to detecting literals amongst the pattern identifiers was made public, one may conclude that the R7RS didn't want to change the existing behaviour, so bound-identifier=? is the correct predicate.

So what I will do now (or quite soon when time permits) is to rewrite the sample implementations of SRFI 147 and SRFI 148 so that they work with the better, more hygienic matching.

We should also ask Alex Shinn to bring his implementation in line with the other (and previous) implementation. There is a simple change necessary here:

https://github.com/ashinn/chibi-scheme/blob/master/lib/init-7.scm#L731

Just change "compare" into "eq?", which is bound-identifier=? in the syntactic closures implementation, Chibi is using. (I made the change locally, and all tests continue to pass.)

With that change, we can hopefully expect that Sagittarius will follow, so at least the R7RS systems Chibi, Sagittarius, and Gauche will agree with previously implementations of R5RS/R6RS/SRFI 46 and which the obviously better semantics.

Marc