owid / etl

A compute graph for loading and transforming OWID's data
https://docs.owid.io/projects/etl
MIT License
79 stars 21 forks source link

Try YAML based explorers #3333

Open Marigold opened 2 weeks ago

Marigold commented 2 weeks ago

Motivation

Our metadata is gradually converging into YAML files within the ETL pipeline, including dataset metadata, multi-dimensional indicators, and TSV files created by create_explorer (though technically stored in a config dictionary). YAML offers several benefits:

It would be an interesting experiment to convert some TSV files into YAML and assess their user-friendliness.

Solution

Design a YAML schema similar to the one used for multi-dimensional indicators (mdims) and attempt to generate a TSV file from it. What are the differences between the mdims and explorer YAML files? If successful, this could be a first step toward unifying mdims and explorers under a YAML-based format.

Open Questions

larsyencken commented 14 hours ago

Totally agree that YAML is the better format. Perhaps the ideal starting point is a TSV blob in MySQL, which we could then try moving to/from some nicer YAML format.

Researchers still occasionally update explorers in the admin, but we believe it's less often that they create them now.