catalyst-cooperative / electricity-demand-mapping

Tools for geographic reallocation of estimated hourly US electricity demand.
MIT License
15 stars 2 forks source link

Link historical hourly demand to geography #1

Closed zaneselvans closed 4 years ago

zaneselvans commented 4 years ago

The main underlying task in this project is to link historical hourly electricity demand to the geographic region in which that demand occurred. Unfortunately, this isn't very straightforward, but we have a bunch of building blocks to work with:

zaneselvans commented 4 years ago

The methodology outlined by Auffhammer et al. (2017) seems promising, but the publication seems to be missing some details. They are trying to link the entities reporting load data (in FERC 714 by planning area) to the utilities listed in EIA 861, which are associated with particular states + counties.

They start with the list of all utilities reporting hourly demand, and exclude the following:

Question: How do they know whose data is is duplicated by other FERC-714 respondents?

For many of the respondents that appear in the FERC data, FERC provides an EIA utility ID. This ID can be used to directly look up the counties served by that utility in the EIA 861.

Question: How do we know it is valid to assume a utility's service territory (recorded in EIA 861) is the same as the "planning area" which that utility administers?

A smaller number of the EIA Utility IDs listed in the FERC 714 data did not map directly to a utility with an indicated service territory, but did map to a utility listed as operating a balancing authority, and (apparently?) EIA 861 provides a mapping of individual utilities with retail service territories to the corresponding utility entity that is the balancing authority, allowing the individual constituent utilities listed in the FERC 714 planning area descriptions to be indirectly linked to a set of counties. (Note that there is substantial overlap in the areas of these balancing authorities, and the individual utility service areas identified in the step above.)

Looking into the historical EIA 861 data, File1.xls starting in 1999, and later on file1_cao_YYYY.xls and finally in 2012 balancing_authority_YYYY.xlsx include mappings of individual EIA utility IDs to the EIA utility IDs of the entities operating their balancing area. However, from 2013 onward, the Balancing Authority code is no longer a utility ID, and instead becomes a string abbreviation. So far as I can tell this string abbreviation doesn't show up anywhere else in the EIA 861 data, but it is associated with individual plants in the EIA 860 data -- so it seems like there was some reorganization of this data on the EIA side, giving each EIA plant a BA instead of each little utility a BA. Which I guess probably makes more sense.

Question: Given the substantial overlap in these territories, what's the right way to allocate demand? What makes us think that the same demand isn't being reported multiple times by different entities

For the remaining utilities reporting hourly demand in FERC 714, which were not mapped to utilities and the counties they serve through the above processes, the authors manually mapped the names of the utilities making up the reporting planning area to individual utilities reported in EIA 861 based on their names alone.

gschivley commented 4 years ago

I’ll try to get my notebook of an initial mapping up later today or tomorrow. I also reached out to Max Auffhammer and he is willing to answer specific questions. A few things I’m running up against so far:

zaneselvans commented 4 years ago

@gschivley My understanding is that in the Auffhammer paper, they were trying to figure out "What location is associated with this demand?" as opposed to "What demand is associated with this location." -- does that seem right to you? Figuring out what all demand is associated with a county seems harder, since you want to try and avoid double counting, or missing demand. It seemed like in their maps there were a lot of counties that showed up associated with more than one of the respondents.

zaneselvans commented 4 years ago

Potential Demand Mapping Method:

Assumptions:

Process:

For each FERC 714 Respondent that reports hourly demand, we must identify the set of County FIPS IDs associated with that respondent.

Sanity Checks / Success Metrics:

gschivley commented 4 years ago

I finally cleaned up (a little) the work I've done and posted it as a notebook. Can be downloaded or run through colab/mybinder.