catalyst-cooperative / pudl

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
https://catalyst.coop/pudl
MIT License
458 stars 105 forks source link

Make stored metadata more modular and dynamic #523

Closed zaneselvans closed 2 years ago

zaneselvans commented 4 years ago

Right now we have a single large metadata library, stored in src/pudl/package_data/meta/datapkg/datapackage.json. This collection of metadata contains a large amount of duplicated information, making it difficult to update and maintain, while keeping it self-consistent, and in line with the current state of the source code. Several potential improvements:

zaneselvans commented 4 years ago

Having a single source of truth for the metadata will help when we do things like enrich the TableSchemas (See #426) with additional constraints, units, etc -- once we've added that information in the One True Place, we'll know that it will propagate to everywhere that information appears.

cmgosnell commented 2 years ago

subsumed within #806