Fixes issue with assignment of subplant IDs
Fixes an issue that was causing some generators/units to be assigned missing or incorrect subplant IDs. This issue caused several downstream issues including inaccurate hourly shapes being assigned to certain subplants, or inaccurate conversion of gross to net generation in the CEMS data. This patch ensures that every generator_id and unitid is associated with a non-missing subplant ID, and that these subplant assignments account for boiler-generator associations from EIA-860 (full details).
Fixes anomalous spikes in emission rate data
Several grid regions were exhibiting anomalous dips in their regional emission intensity values due to an issue with the methodology used to shape data from plants with partial CEMS data. Specifically, the generation from certain non-emitting plants (e.g. nuclear, solar, etc) that had a fossil-fuel backup generator onsite were being assigned the intermittent hourly profile of the backup generator if that generator reported to CEMS. This resulted in data quality issues in both the generated and consumed emission rates for some regions. This patch fixes that issue by excluding all non-emitting generators and plants with subplants of mixed fuel types from using the partial CEMS methodology. (full details).
Other updates
Fixes an issue that was resulting in an infeasible conda environment by updating our dependency on the Public Utility Data Liberation Project to a stable branch of the project.
Improves the speed of running the part of the data pipeline that identifies subplant IDs.
Updates the plant_metadata.csv file to help users more easily identify the methodologies used for each plant.
Adds adjusted R2 values to the gross to net generation regression outputs available in the data/outputs/gross_to_net_conversions.csv file.
Renames the data/outputs/subplant_crosswalk.csv file to subplant_crosswalk_[YEAR].csv to clarify that subplant IDs are only valid for a specific year.
Merges v0.1.2 patch updates into main.
Fixes issue with assignment of subplant IDs Fixes an issue that was causing some generators/units to be assigned missing or incorrect subplant IDs. This issue caused several downstream issues including inaccurate hourly shapes being assigned to certain subplants, or inaccurate conversion of gross to net generation in the CEMS data. This patch ensures that every
generator_id
andunitid
is associated with a non-missing subplant ID, and that these subplant assignments account for boiler-generator associations from EIA-860 (full details).Fixes anomalous spikes in emission rate data Several grid regions were exhibiting anomalous dips in their regional emission intensity values due to an issue with the methodology used to shape data from plants with partial CEMS data. Specifically, the generation from certain non-emitting plants (e.g. nuclear, solar, etc) that had a fossil-fuel backup generator onsite were being assigned the intermittent hourly profile of the backup generator if that generator reported to CEMS. This resulted in data quality issues in both the generated and consumed emission rates for some regions. This patch fixes that issue by excluding all non-emitting generators and plants with subplants of mixed fuel types from using the partial CEMS methodology. (full details).
Other updates
plant_metadata.csv
file to help users more easily identify the methodologies used for each plant.data/outputs/gross_to_net_conversions.csv
file.data/outputs/subplant_crosswalk.csv
file tosubplant_crosswalk_[YEAR].csv
to clarify that subplant IDs are only valid for a specific year.