meedan / check

Development environment for Meedan Check, a collaborative media annotation platform
https://meedan.com/check
MIT License
127 stars 53 forks source link

Extract fact-checks from Web Data Commons #14

Closed infojunkie closed 1 year ago

infojunkie commented 4 years ago

Tell us about your request Web Data Commons extracts metadata from Common Crawl. We can extract Schema.org ClaimReview entries from this dataset and populate our Fetch database with such entries.

Implementation details

DGaffney commented 4 years ago

So it sounds like essentially, Web Data Commons is an aggregator of Schema.org-compliant content, some subset of which is bound to be ClaimReview objects. In principle, adding their ClaimReview objects into the database is a no-brainer then. Should not be too difficult to add this into fetch.