singularity-energy / open-grid-emissions

Tools for producing high-quality hourly generation and emissions data for U.S. electric grids
MIT License
75 stars 5 forks source link

Add regional outputs to `data_quality_metrics` #227

Closed gailin-p closed 1 year ago

gailin-p commented 2 years ago

Currently, data_quality_metrics outputs report proportions of data sources and methods across the entire dataset. These dataset-wide averages may not be representative for smaller BAs. For example, PACW in 2019-2020/v0.1.1 data has a much lower emission rate than real-time estimates (below). When identifying the source of this discrepancy, it would be useful to be able to quickly identify what proportion of the FPC data is CEMS vs shaped EIA-923. More broadly, it would be useful to identify BAs where a majority of fossil generation is shaped EIA-923 data, because we may have lower confidence in the hourly rates in these BAs.

Screen Shot 2022-09-12 at 1 26 46 PM Difference in PACW OGE emission rate (blue) and real-time emission rate (red) (real-time rate is calculated from EIA-930 per-fuel generation and eGRID 2019 annual average emission rates). In this case, the discrepancy is due to coal generation listed for PACW in EIA-930 data (used to derive the real-time rates).

grgmiller commented 1 year ago

Closed with the v.0.2.0 release