For HTML elements only the attributes name, rel, content and http-equiv are extracted. The attribute property is missing which leads to unpaired, value-only items in the WAT file
property is an RDFa attribute and is not part of the HTML standard. However, it's widely used. The WAT specification describes the data contained in HTML-Metadata as "attributes and values of HTML head elements: title, base, style, link, meta and script". There is no explicit restriction to attributes covered by one of the HTML standards.
(reported by Christian Lund on Common Crawl Google group)
For HTML elements only the attributes
name
,rel
,content
andhttp-equiv
are extracted. The attributeproperty
is missing which leads to unpaired, value-only items in the WAT filee.g, for open graph properties
property
is an RDFa attribute and is not part of the HTML standard. However, it's widely used. The WAT specification describes the data contained inHTML-Metadata
as "attributes and values of HTML head elements: title, base, style, link, meta and script". There is no explicit restriction to attributes covered by one of the HTML standards.