catalyst-cooperative / pudl

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
https://catalyst.coop/pudl
MIT License
456 stars 106 forks source link

Add dagster to documentation #1872

Closed bendnorman closed 1 year ago

bendnorman commented 1 year ago

Dagster introduces a lot of new concepts and code to PUDL. Make sure we continue the tradition of super rad documentation.

jdangerx commented 1 year ago

We can split this into multiple PRs - just update the checklist with which categories are spoken for.

jdangerx commented 1 year ago

@bendnorman will flesh out some of the bullet points above as well.

bendnorman commented 1 year ago

Here are the documentation changes I'm planning on making:

Running the ETL

Cloning FERC Form1 DB

Release Notes

We could also add dagster concept explanations and uses to the Data and ETL Design Guidelines section though we could punt this part of the docs once dagster-asset-etl is merged in. @catalyst-cooperative/inframundo what do you think?

zschira commented 1 year ago

I think what you've laid out here seems like a good minimal set of docs updates before merging to dev. I think detailed explanations of dagster concepts can wait, as long as there's sufficient info to get people up and running with the new architecture. I also think doing a live run through with our team once we merge to dev will be helpful while we continue to flesh out the docs.