Closed domoritz closed 2 months ago
It should be great to include the license of the source files as well.
it would be cool if there were some elementary statistics for each of the datasets
I think there are at least 2 components that this issue could be split up into
SOURCES.md
file into something machine readable, like a JSON file, or a folder of YAML files. We could adopt a process similar to what "awesome public datasets" ( https://github.com/awesomedata/awesome-public-datasets ) or "campusdata" did in the past: https://github.com/CampusData/campusdata.github.io/blob/master/_data/rankings.yml . In the meantime, there are at least 2 peer projects that can fulfill some of the data exploration usecases for the single file data requests
world-110m.json looks like it could be from https://www.jsdelivr.com/package/npm/world-atlas?version=1.1.4&path=world (https://github.com/topojson/world-atlas).
Seems like this has fallen by the wayside, but, should it ever come back to development: it would be cool if there were some elementary statistics for each of the datasets. Like how many rows of data, the names of the columns, the types of those columns, etc. Basically the same collection of things that kaggle lists for lots of the datasets on there