-
Before we can get rid of some of the metadata structures which remain in constants.py and completely move to the new Pydantic metadata system, we need to compile complete metadata for the `ferc714` da…
-
## Where is the Data Source metadata now
* `pudl/metadata/sources.py` (where we're trying to consolidate it)
* `pudl-zenodo-storage/zs/metadata.py`
* Some subcomponents are defined in `pudl/metadat…
-
Once we've successfully mapped technology descriptions onto the ferc1 small generators table, we'll need to determine how to fill in the gaps--manually?
-
We still haven't archived the outputs of the hourly state level demand allocation project, and the historical balancing authority and utility service territories. Might as well use the fresh v0.4.0 so…
-
A review of the field metadata in `src/pudl/package_data/meta/datapkg/datapackage.json` reveals the following inconsistencies in `constraints.enum` and `description` for fields of the same name across…
-
A few minor edits to f1_respondent_id:
new respondents in 2019:
529: Tri-State G & T Assn, Inc
531: Basin Electric Power Coop
also, a couple of edits to names for respondents already in f1_res…
-
We don't include a few very large binary-ish tables from the original ferc1 DB in the distributed PUDL data, since they increase the size of the DB by 10x, and are of very little use to anyone as is. …
-
Add confidence metric to each of the imputation methods
-
If a generator is retired but has similar identifiers (fuel type, prime mover etc.), currently it will get lumped together with existing generators. The master unit list records that contain retired g…
-
1. Calculate some statistics of Billsum data:
- Avg. length of document
- Avg. length of ground truth document summary
- Avg. length of predicted document summaruy
2. Give sample predi…