catalyst-cooperative / pudl

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
https://catalyst.coop/pudl
MIT License
456 stars 105 forks source link

investigate difference in net get from generation_eia923 vs generation_fuel_eia923 tables #750

Closed cmgosnell closed 3 years ago

cmgosnell commented 3 years ago

gen table = generation_eia923 gen_fuel = generation_fuel_eia923

I assume this is all a vestige of reporting requirements.

zaneselvans commented 3 years ago

Have you made a plot of the annual ratio of generation reported in one table vs the other? Or even the monthly ratio, since these numbers are reported monthly? Like does it go to ~1.0 in 2014 when the combustion turbines start reporting?

cmgosnell commented 3 years ago

It is kinda hard to see in one graph: image

But here they are broken out by these chunks of time: image image

It doesn't entirely go to zero post-2014, but more is less.

zaneselvans commented 3 years ago

I was thinking just a line plot, with time on the x axis, and ratio on the y axis.

cmgosnell commented 3 years ago

image

(I don't have a few years of old data in my db right now for some reason)

zaneselvans commented 3 years ago

Okay great, so it seems like the overwhelming majority of the issue here is the lack of combustion turbine reporting prior to May 2014.

cmgosnell commented 3 years ago

Hm.. well, no. That was only the plants which had some reporting to the gen table. I remade it w/ a note.. and made the second w/ all of the missing net gen. Maybe it is going up over these last few years due to more gas on the system?

image