libwww-perl / HTML-Parser

The HTML-Parser distribution is is a collection of modules that parse and extract information from HTML documents.
Other
6 stars 13 forks source link

HTML 5 [rt.cpan.org #53300] #2

Open oalders opened 4 years ago

oalders commented 4 years ago

Migrated from rt.cpan.org#53300 (status was 'new')

Requestors:

From cfaerber@cpan.org on 2010-01-02 21:51:23 :

HTML::Parser should provide a parsing mode that is fully compliant with HTML 5,
section 9.2 ("Parsing HTML documents",
http://dev.w3.org/html5/spec/Overview.html#parsing).

As this will probably differ significantly from current behaviour, it should be
optional.
castaway commented 2 years ago

HTML::Tagset with v5 support is being worked on: https://github.com/petdance/html-tagset/issues/2

oalders commented 2 years ago

@castaway is there anything else to be done in HTML::Parser once that PR has been merged?

esabol commented 1 year ago

What about the HTML 5 entities? I think there are some new ones, right? Should HTML::Entities support them, perhaps with an option? Or just by default even.

Refer to https://metacpan.org/pod/HTML::HTML5::Entities.

oalders commented 1 year ago

@esabol I think if someone wanted to take that on, that would be great.

castaway commented 1 year ago

@castaway is there anything else to be done in HTML::Parser once that PR has been merged?

Yer asking the hard questions.. I don't think so...