Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing

Show/hide message thread

Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 14:01 UTC)
(missing)
Fwd: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 16:44 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Jens Axel Søgaard (06 Mar 2019 15:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 16:01 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Jens Axel Søgaard (06 Mar 2019 18:39 UTC)
(missing)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 20:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 21:53 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 22:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 03:03 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 15:34 UTC)
(missing)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 16:36 UTC)
Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 16:45 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 17:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 19:29 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 19:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 19:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Amirouche Boubekki (05 Mar 2019 23:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 19:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (05 Mar 2019 19:59 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (05 Mar 2019 22:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (05 Mar 2019 22:47 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 03:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 06:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Marc Nieper-Wißkirchen (06 Mar 2019 10:12 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 11:53 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 12:32 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 12:33 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Alex Shinn (06 Mar 2019 14:17 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:32 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Amirouche Boubekki (06 Mar 2019 00:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 12:54 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 14:22 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:33 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:39 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 14:50 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 15:01 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 15:04 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 15:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 14:58 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 15:00 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 15:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 16:02 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 16:54 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 17:50 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 17:58 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 18:12 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 18:17 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 18:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 18:48 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 00:45 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 02:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 09:10 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 10:59 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 13:07 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 14:44 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 14:56 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 16:47 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 20:20 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:26 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (07 Mar 2019 20:29 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:54 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (07 Mar 2019 21:42 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:08 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (07 Mar 2019 21:31 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 23:26 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:21 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:10 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:00 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 18:15 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:22 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (06 Mar 2019 18:35 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:39 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 18:51 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 19:14 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (06 Mar 2019 19:25 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 19:57 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 20:05 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 21:24 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (06 Mar 2019 21:34 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Alex Shinn (06 Mar 2019 23:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 09:15 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Alex Shinn (07 Mar 2019 20:44 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:15 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 21:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 21:51 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Per Bothner (06 Mar 2019 17:45 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 00:16 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 09:37 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 03:11 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 10:01 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 20:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 20:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 21:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:30 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 21:38 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 22:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (07 Mar 2019 22:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (07 Mar 2019 23:17 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 23:28 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (08 Mar 2019 00:24 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (08 Mar 2019 00:34 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (08 Mar 2019 09:35 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 13:19 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 13:27 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 13:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 13:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 14:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (08 Mar 2019 00:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (08 Mar 2019 09:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 12:36 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 12:46 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 22:18 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing John Cowan (07 Mar 2019 23:33 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 23:40 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (07 Mar 2019 21:02 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (10 Mar 2019 20:49 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing elf (11 Mar 2019 03:06 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Arthur A. Gleckler (11 Mar 2019 03:12 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing elf (11 Mar 2019 10:39 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 10:48 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 11:43 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 14:09 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Lassi Kortela (11 Mar 2019 14:32 UTC)
Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun (11 Mar 2019 14:36 UTC)

Re: Proposal to add HTML class attributes to SRFIs to aid machine-parsing Ciprian Dorin Craciun 05 Mar 2019 19:45 UTC

A note about HTML (4 or 5) vs XHTML.

My suggestion for XHTML is purely pragmatical:  XHTML is XML, then one
can just use any XML library to parse the document.

Now I know that it "seems" that there are many HTML parsers out there,
unfortunately this is not true...  There are a few, at least for the
most popular programming languages, however they are "bloated" and
full of issues...

I know this because I've tried once to use such tools and tried
"Beautiful Soup" for Python and failed...  Then I've settled on
https://github.com/ericchiang/pup and exported the whole thing as JSON
and moved on from there...

Ciprian.

BTW, the tool I've mentioned `pup` can be used to for HTML meta-data extraction.