Add catalystcoop.pudl_catalog to our JupyterHub

catalyst-cooperative / pudl-catalog

An Intake catalog for distributing open energy system data liberated by Catalyst Cooperative.

MIT License

9 stars 2 forks source link

Once we have an initial release of the data catalog on PyPI / conda-forge (see #10):

[x] Add catalystcoop.pudl_catalog to the environment specified in the pudl-examples repo which builds the Docker container for our JupyterHub.
[x] Test accessing the cloud data on the JupyterHub both using pd.read_parquet() and through the Intake catalog, both for the monolithic and partitioned datasets, with and without caching turned on to see what the user experience is like.
[x] Make this the default way of accessing the EPA CEMS data on the JupyterHub, so we can reduce our disk usage and avoid the hassle of uploading new data to the hub whenever we do a data release.

catalyst-cooperative / pudl-catalog