Open westurner opened 9 years ago
https://github.com/scrapinghub/extruct
extruct is a library for extracting embedded metadata from HTML markup. It also has a built-in HTTP server to test its output as JSON. Currently, extruct supports: W3C's HTML Microdata embedded JSON-LD Microformat via mf2py Facebook's Open Graph (experimental) RDFa via rdflib
extruct is a library for extracting embedded metadata from HTML markup.
It also has a built-in HTTP server to test its output as JSON.
Currently, extruct supports:
https://github.com/scrapinghub/extruct