singularity-energy / open-grid-emissions

Tools for producing high-quality hourly generation and emissions data for U.S. electric grids
MIT License
72 stars 5 forks source link

Validate Partial CEMS methodology for subplants #247

Open grgmiller opened 1 year ago

grgmiller commented 1 year ago

During peer-review of the hourly shaping methodology, a question was raised about whether the assumption behind the partial cems shaping methodology is robust (namely that all units within a subplant have similar operational profiles, and that all subplants within a plant have similar operational profiles).

For example:

On the greg/research branch I have started to explore this issue by examining the correlation coefficient between the hourly fuel consumption profile across units within each subplant, and across subplants within each plant. This preliminary examination revealed that for 2020, the mean correlation coefficient of the hourly fuel consumption of different units within a subplant is 0.67, and for subplants within a plant is 0.39. This suggests that perhaps the partial_cems_plant shaping method is less robust.

However, this raises the question of whether the fleet-average residual profile more accurately reflects the profile of that missing generation than the plant-specific average generation profile? In other words, even if this assumption isn't great, are the alternatives any better? I'm not sure that we could validate that. Maybe one approach to try and cross validate would be to ask how well correlated the generation profile of each specific subplant is with the fleet average generation profile of all subplants that report to CEMS? i.e. is the average representative of each unique unit?