-
# Description
Currently we provide access to more human-readable denormalized outputs using software routines. This adds a layer of complexity and requires users to use Python. It's also kind of slow…
-
Scope:
* [x] merge `dev` into `dagster_asset_etl`
* [ ] write script (potentially using sqlitediff) to compare legacy/dagster pudl.sqlites.
-
@zschira commented on [Tue Sep 13 2022](https://github.com/catalyst-cooperative/pudl-scrapers/issues/53)
Once the archiver/scraper repos have been combined, and we have high level scripts for managin…
-
Dagster introduces a lot of new concepts and code to PUDL. Make sure we continue the tradition of super rad documentation.
- [x] Release notes
- [x] Overview of dagster concepts: Resources, ops, j…
-
multi_assets require you to specify the outputs in the mutli_asset decorators. We have some multi_assets that output dozens of assets so we'll need a way to look up table names by multi_asset name.
…
-
Requirements:
* [x] The new ETL settings file should allow us to specify inputs for both the FERC1 DB cloning and the main PUDL ETL, with the possibility of adding new sections as we incorporate ad…
-
As soon as the nightly builds succeed on `dev` we'll be ready to merge into `main` and tag a new PUDL release that includes all 2021 data for all of our covered datasets.
## Release Checklist / Not…
-
Many utilities identified in FERC, EIA, and EPA data are subsidiaries owned by some larger utility holding company. Understanding these ownership relationships can be helpful in understanding the econ…
-
Create dagster assets for the raw and transformed FERC 714 data so the partially cleaned tables can be accessed in the pudl.sqlite DB. Once the ETL has been converted to dagster #2104 we can start to …
-
PUDL creates an id called a `subplant_id` in the `analysis/epacamd_eia.py` module. In short, this id identifies unique operating entities (combustor-generator combinations) within reported `plant_id_e…