-
The PUDL Kaggle dataset last updated on August 24th, but it's supposed to pull new data every Monday from the S3 bucket, so something is broken.
I attempted a manual update and it failed with an erro…
-
### Overview
In order to better trace the development of PUDL, the success of our outreach efforts and the effects of our new Superset instance, we need to revitalize the `pudl-usage-metrics` repos…
-
Before going full steam ahead on implementation, it makes sense to think through the design a bit to see if there are any avoidable pitfalls.
Doc is [here](https://docs.google.com/document/d/1ot3eFBw…
-
--WIP--
### Description
This epic coordinates the integration of the Vibrant Pattern Energy Renewable Generation datasets. It contains estimated county-averaged hourly capacity factor for wind and so…
-
With the update to the `conda-forge` feedstock in #465 I've just upgraded from v3.0.0 to v3.2.1 of `sphinx-autoapi`, and am now getting errors like these when I run `sphinx-build`, which I never got b…
-
As previously addressed in e.g. #2417, #2996, #3003, #3208, and #3211, SQLite can't handle multiple concurrent writes. Our SQLite IO Manager has worked around this for the PUDL DB, but we seem to be…
-
## PHMSA distribution data (1990-present)
PHMSA distribution data explains the extent, safety record, and characteristics of each operator's distribution system by state and commodity group. The fi…
-
Kaggle kernel (notebook) metrics are available through the Kaggle API using the following bash command:
`kaggle kernels list --page-size 200 --dataset catalystcooperative/pudl-project -v`
I've pok…
-
The `fix/8` branch introduces changes that address issue #8 and downstream user concerns (e.g. https://github.com/catalyst-cooperative/pudl/issues/3531). However, the changes made in this branch haven…
-
Superset does not support loading data from sqlite so we want to use duckdb instead! Duckdb is well suited for our data because it's designed to handle local data warehouses. It's also a cheaper optio…