In an early stage of the QRank pipeline, we compute a mapping from Wikimedia sitelinks to Wikidata entity IDs. This mapping seems generally useful. For example, OpenStreetMap quality assurance tools could use this for checking the consistency of wikipedia and wikidata tags on geographic features.
[ ] re-implement the sitelinks extraction in Rust for better performance / lower memory consumption
[ ] set up a cronjob to re-generate the sitelinks file once per week
[ ] document the format and schema of the sitelinks file
In an early stage of the QRank pipeline, we compute a mapping from Wikimedia sitelinks to Wikidata entity IDs. This mapping seems generally useful. For example, OpenStreetMap quality assurance tools could use this for checking the consistency of
wikipedia
andwikidata
tags on geographic features.