govdirectory / website

Website repository for Govdirectory - a crowdsourced and fact-checked directory of official governmental online accounts and services.
https://govdirectory.org
Creative Commons Zero v1.0 Universal
48 stars 32 forks source link

Generate and publish full data dumps #24

Open Abbe98 opened 3 years ago

Abbe98 commented 3 years ago

As a data journalist/archivist I will ingest and filter the data in my own tooling.

Task:

Generate data dumps using the snowman application/sparql-results+json cache by turning it into CSV or another format more common than SPARQL resultsets.

Abbe98 commented 2 years ago

Following support in Snowman for invalidating unused cache items(https://github.com/glaciers-in-archives/snowman/issues/5) we could publish the Snowman cache as our data dump, but we might want to rename the files somehow as they are currently named by SHA-hashes.

Abbe98 commented 1 year ago

The browser extension currently bundles such a dump.