catalyst-cooperative / pudl

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
https://catalyst.coop/pudl
MIT License
456 stars 106 forks source link

Make a complete, re-usable boiler-generator association table. #107

Closed zaneselvans closed 6 years ago

zaneselvans commented 6 years ago

We can create a re-usable and complete (or at least, as good as it gets) boiler generator association table that we use instead of the EIA860 table -- using inter-year information where it's available. Will be useful going forward. Most of this work has been done in the MCOE work for CPI... but needs to be integrated into one thing.

cmgosnell commented 6 years ago

We currently have a temporary function which generates a complete bga, although we'd like to move this into the ingest process as a data table.

Because it relies on data in the database, several things need to happen before that:

zaneselvans commented 6 years ago

The mcoe.boiler_generator_association() function is looooooong, but now does a good job of creating boiler-generator associations, by creating unit_ids based on the full list of historically associated boilers and generators, using a simple network analysis. This function can hopefully be adapted pretty directly to the ingest process, and the addition of a unit_id to the boiler and generator entity tables, allowing them to be joined simply when needed.

cmgosnell commented 6 years ago

whoop! finally done. bga is a table and there is no longer a bga function in mcoe.py