catalyst-cooperative / pudl

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
https://catalyst.coop/pudl
MIT License
450 stars 106 forks source link

Update DOI automatically for well-behaved datasets #3639

Open jdangerx opened 2 weeks ago

jdangerx commented 2 weeks ago

Overview

We don't update the DOI reference immediately upon publishing a new archive, because oftentimes using a new version of a dataset causes problems. Instead we wait until we have manually checked all the new differences. This is time-consuming and unnecessary for some datasets.

Success Criteria

How will we know that we're done?

Tasks

What are the next steps to take?

* [ ] write a PR template that includes instructions for PR approval
* [ ] write a GH action that detects new DOIs and makes PRs accordingly
zaneselvans commented 2 weeks ago

I'm not sure EIA-860M belongs in this list (sorry if I'm the one that mentioned it!) since it's where we'll first see new EIA plants that need to be manually mapped to PUDL Plant IDs.

jdangerx commented 2 weeks ago

Yeah, that makes sense! I just copied over your list from the original issue but obviously happy to change that 😄