Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing

Show/hide message thread

Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 14:01 UTC)
(missing)
(missing)
Fwd: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 16:44 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Jens Axel Søgaard (06 Mar 2019 15:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 16:01 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Jens Axel Søgaard (06 Mar 2019 18:39 UTC)
(missing)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 20:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 21:53 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 22:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 22:47 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 03:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 06:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Marc Nieper-Wißkirchen (06 Mar 2019 10:12 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 11:53 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 12:32 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 12:33 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Alex Shinn (06 Mar 2019 14:17 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:32 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 03:03 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 15:34 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 16:36 UTC)
Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 16:45 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 17:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 19:29 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 19:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 19:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Amirouche Boubekki (05 Mar 2019 23:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 19:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 19:59 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 22:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Amirouche Boubekki (06 Mar 2019 00:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 12:54 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 14:22 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:33 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:39 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 14:50 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 15:01 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 15:04 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 15:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:58 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 15:00 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 15:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 16:02 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 16:54 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 17:50 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 17:58 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 18:12 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 18:17 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 18:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 18:48 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 00:45 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 02:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 09:10 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 10:59 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 13:07 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 14:44 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 14:56 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 16:47 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 20:20 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:26 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (07 Mar 2019 20:29 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:54 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (07 Mar 2019 21:42 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:08 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (07 Mar 2019 21:31 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 23:26 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:21 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:10 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:00 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 18:15 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:22 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 18:35 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:39 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:51 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 19:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 19:25 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 19:57 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 20:05 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 21:24 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 21:34 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 03:11 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 10:01 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 20:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 21:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:30 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 21:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 22:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 22:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 23:17 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 23:28 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (08 Mar 2019 00:24 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (08 Mar 2019 00:34 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (08 Mar 2019 09:35 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 13:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 13:27 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 13:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 13:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 14:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (08 Mar 2019 00:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (08 Mar 2019 09:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 12:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 12:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 22:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 23:33 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 23:40 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:02 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (10 Mar 2019 20:49 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing elf (11 Mar 2019 03:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (11 Mar 2019 03:12 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing elf (11 Mar 2019 10:39 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 10:48 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 11:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 14:09 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 14:32 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 14:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Alex Shinn (06 Mar 2019 23:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 09:15 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Alex Shinn (07 Mar 2019 20:44 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:15 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 21:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 21:51 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 17:45 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 00:16 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 09:37 UTC)

Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun 06 Mar 2019 12:53 UTC

On Wed, Mar 6, 2019 at 1:53 PM Lassi Kortela <xxxxxx@lassi.io> wrote:
> What does everyone you think? I would prefer to try this first because
> it's an established standard with readily available tools. All the
> while being simple and familiar (they just use HTML with light
> additions, much like we've been doing so far, and specify a standard
> conversion formula to JSON). This would also settle our debate about
> how to use class attributes, since they specify that :-P
>
> For example, here's how one might mark up a procedure definition:
>
>      <p class="h-proc-def">
>        <b>Procedure: </b>
>        <code>
>          <a class="p-name" name="make-array">make-array</a>
>          <var class="h-arg">interval</var>
>          <var class="h-arg">getter</var>
>          [ <var class="h-arg"><span
> class="p-type">optional</span>setter</var> ]
>        </code>
>      </p>

(Without upsetting anyone) I really think that this is the best
example on how to fail at this endeavor.

It is so complex:

* in order to identify that `make-array` is actually a procedure
definition, we have to look for an element that has the class
`h-proc-def`, which should contain (somewhere not necessarily in
direct children) an element with the class `p-name` whose attribute is
the actual "name" of the procedure;  just trying to think about
expressing this in code, especially with XML libraries or XSLT scares
me...  (for a second try just try to imagine how the code to extract
arguments looks like;)

* it provides too much overhead:  it has too much duplication, the
`make-array` token appears twice;

* it fails to capture all signature elements:  what is the output of
the procedure?  what are the types of various arguments?

When designing this format think about how one could use `pup` / `jq`
to extract the data.

My proposal is to keep things simple:

* for indexing just using `<a class="proc-def">make-array</a>` is enough;

* for actual signatures I think an S-expression based description is
better (however see the other paragraph where I note that perhaps this
is too much for SRFI's);  for example:

  https://github.com/volution/vonuvoli-scheme/blob/development/documentation/libraries-r7rs.ss#L6373

    (make-vector
        (type constructor)
        (export scheme:base)
        (signature
            ((range-length-zero) -> vector-empty)
            ((range-length-zero any) -> vector-empty)
            ((range-length-not-zero) -> vector-not-empty)
            ((range-length-not-zero any) -> vector-not-empty))
        ...

I've tried hard to think about this problem (when I did my R7RS
documentation conversion) and came to the conclusion that one can't
expect to extract accurate information from "text" documents without
making a mess out of them.  Then I came to the conclusion that just
"back-referencing" things in the actual text, and providing "external"
structured syntax / signatures is the best approach.

Take the example of `cond`:

  https://github.com/volution/vonuvoli-scheme/blob/development/documentation/libraries-r7rs.ss#L2932

I start with the structured syntax signature, and then in the
description, formatted as CommonMark I could use CommonMark references
to "special" tags to link to other elements.  (I haven't included one
in `cond`'s description, but I have technical support for it in the
parser / formatter.)

However this is perhaps too-much for the SRFI use-case.  Instead I
think just having a few "markers" to allow indexing /
back-referencing, then a simplified / standard structure (sections,
paragraphs, lists, code snippets, etc.) is enough.  Based on this one
can take the (X)HTML and "render" it as CommonMark / other formats to
be included in his own documentation.

>  > I'm still not convinced of the need to go all the way to XHTML.
>
>  > If there's a reliable way to convert from HTML to XHTML, then
>  > there's no need for XHTML to be the on-disk format.
>
> XHTML is definitely not necessary for indexing, but it may be
> necessary/much easier for the full-text conversions Ciprian would like
> to do. I guess the decision rests on how easy it is to automate?

I agree that XHTML is not strictly necessary, but as highlighted above
it would help us from a technical point of view, especially to convert
it to other formats to be included in other documentations.

Ciprian.