-
Schedule 8 data in EIA 923 contains valuable monthly information about the operation, cost and status of environmental equipment data. Let's bring it in!
```[tasklist]
### Extract raw data
- [ ] http…
-
Some of our output tables are mostly just grab x table and denormalize it (merge entity x table into it), but sometimes we do imputations or filling in of data.
There are also two main ways we acce…
-
get new perma links from
http://pudl.princeton.edu/collection.php?c=papyri&sort=title&rpp=247&start=0
in APIS search for ```AM8957```
-
The `source.discover()` method shows some details about the internals of a data source within an intake catalog. E.g.
```py
import intake
pudl_cat = intake.cat.pudl_cat
pudl_cat.hourly_emissions_e…
-
We have integrated most of the tables from EIA 860 and 923, but we're still missing several. This issue collects all tables that are still missing, so we can keep track of our progress towards complet…
-
Our initial PUDL ETL DAG has a couple of clear performance bottlenecks, and a few assets or asset groups with more dependencies than are really necessary. Modest refactoring of some of these assets or…
-
Dependabot has been faithfully opening pull requests but they fail with this error message ([logs](https://github.com/catalyst-cooperative/pudl-usage-metrics/actions/runs/3637887707/jobs/6139426645)):…
-
-
The main issue with integrating truly monthly data from YTD sources is in aggregating. If we have only a quarter of data and `freq="AS"` in a `pudl.output.pudltabl.PudlTabl` object, then we need to do…
-
Reading parquet files which are stored on the local filesystem through the current PUDL catalog still results in caching. This slows things down dramatically, and quickly uses an enormous amount of di…