Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing

Show/hide message thread

Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 14:01 UTC)
(missing)
Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 16:45 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 17:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 19:29 UTC)
(missing)
Fwd: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 16:44 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Jens Axel Søgaard (06 Mar 2019 15:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 16:01 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Jens Axel Søgaard (06 Mar 2019 18:39 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 15:34 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 16:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 19:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 19:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Amirouche Boubekki (05 Mar 2019 23:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 19:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 19:59 UTC)
(missing)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 22:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 20:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 21:53 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 22:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 22:47 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 03:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 06:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Marc Nieper-Wißkirchen (06 Mar 2019 10:12 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 11:53 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 12:54 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 14:22 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:33 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:39 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 14:50 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 15:01 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 15:04 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 15:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:58 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 15:00 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 15:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 16:02 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 16:54 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 17:50 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 17:58 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 18:12 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 18:17 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 18:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 18:48 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 00:45 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 02:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 09:10 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 10:59 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 13:07 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 14:44 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 14:56 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 16:47 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 20:20 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:26 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (07 Mar 2019 20:29 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:54 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (07 Mar 2019 21:42 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:08 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (07 Mar 2019 21:31 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 23:26 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:21 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:10 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:00 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 18:15 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:22 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 18:35 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:39 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:51 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 19:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 19:25 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 19:57 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 20:05 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 21:24 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 21:34 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 03:11 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 10:01 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 20:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 21:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:30 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 21:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 22:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 22:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 23:17 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 23:28 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (08 Mar 2019 00:24 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (08 Mar 2019 00:34 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (08 Mar 2019 09:35 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 13:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 13:27 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 13:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 13:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 14:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (08 Mar 2019 00:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (08 Mar 2019 09:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 12:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 12:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 22:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 23:33 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 23:40 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:02 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (10 Mar 2019 20:49 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing elf (11 Mar 2019 03:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (11 Mar 2019 03:12 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing elf (11 Mar 2019 10:39 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 10:48 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 11:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 14:09 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 14:32 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 14:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Alex Shinn (06 Mar 2019 23:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 09:15 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Alex Shinn (07 Mar 2019 20:44 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:15 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 21:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 21:51 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 17:45 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 00:16 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 09:37 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 12:32 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 12:33 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Alex Shinn (06 Mar 2019 14:17 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:32 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 03:03 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Amirouche Boubekki (06 Mar 2019 00:18 UTC)

Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun 07 Mar 2019 21:05 UTC

On Thu, Mar 7, 2019 at 10:46 PM Lassi Kortela <xxxxxx@lassi.io> wrote:
> This is great, but how would you integrate this into the current SRFI
> process? The change in HTML structure would be very large. Would you
> mandate this for new SRFIs, have editors convert submitted SRFIs to this
> format by hand, or try to write an automated conversion tool?

As hinted in a previous email, I think we should approach these SRFI
documents as an actual "magazine", in which the authors submit their
"papers", then when a final version is reached, the "publisher" takes
over and re-formats the actual text.  Moreover once this process is
over, no re-doing is needed, as the actual SRFI documents are
read-only.

Unfortunately (after looking at SRFI-1) I don't think this
restructuring can be done automatically...  For example I've found
`<code>` inside `<code>` or `<pre>`, multiple identifiers contained in
the same `<code>` element, invalid closed elements, etc.  Moreover the
current HTML structure is mainly focused on presentation, thus some
`<code>` elements contain `&nbsp;` for tabulation, etc.

(I don't want to criticize the authors / editors, an unfortunately
HTML is hard to edit correctly.  I just state the current status.)

Therefore, given that there aren't many "final" SRFI's (only 128), I
think such a process, although lengthy, would yield the best quality
documents.

> > Because I still maintain my position that trying to extract more than
> > basic metadata about the procedures described within, I'll simplify
> > and remove extra classes or elements (if they exist).
>
> I'm not sure which metadata you consider basic.
>
> The tool I wrote relies only on plain text (in S-expression syntax) to
> extract the names of arguments and return values, including optional /
> rest arguments. It doesn't rely on HTML tags or classes for anything at
> all. So whatever HTML structure you end up with, as long as you can pull
> the plain text of a definition, its arguments can be extracted.

As stated in previous emails, although I agree that "something" is
better than "nothing", and given the fact that you've managed to pull
this is extraordinary, in the end the extracted information is not
"complete" nor "reliable"...

My "end-goal" (but long-term) is to have for Scheme something similar
to Erlang's `dyalizer`:  https://learnyousomeerlang.com/dialyzer

(And for such a goal, "unreliable" signatures are almost useless...)

Ciprian.