Closed jdechalendar closed 5 months ago
@zaneselvans I think the plots here can be used to provide some answers to questions you had asked about how much of an adjustment was being made by the different data processing tools in this repo, especially the data-reconciliation step.
Summary of changes
ge_download --bulk
downloads the bulk dataset that was created in #20. Files are in 6-month chunks.gridemissions.load_bulk("elec")
loads the electricity dataset. The six-month chunks are merged by this function.Changes made by our data processing tools to the raw datasets
Below is some output from the corresponding jupyter notebooks. Future work: add some of this content to energy.stanford.edu/gridemissions.
Note: before running these notebooks, you need to download the bulk datasets, which you can now do with
ge_download --bulk
.Figs 2-5 from de Chalendar and Benson 2021.ipynb
Paper on the arXiv. I added dropdowns to slice data by year.
Figure 2 from the paper for 2018 below.
Figure 2 from the paper for 2023 below.
Figures 3-5 (not shown here but available in the notebook) summarize adjustments for interchange data and generation by source.
Cleaning plot.ipynb
gridemissions.viz.reports.cleaning_plot
is a plotting function I used when I initially developed this tool. I updated it as part of this PR and am showing how I use it in a notebook. BPAT is one of the balancing areas whose data is most changed by the data reconciliation algorithm (I have not thoroughly investigated why).The
cleaning_plot
function has different options. I am showing two options below for the BPAT example. The red line in the first subplot below indicates a large mismatch (up to 5GW) between supply, demand and net interchangeBy passing the
other
kwarg, I can compare two different datasets. Here I am comparing the raw dataset to the final dataset. Note that this function can also be used to compare intermediate datasets in the gridemissions workflow. This can get quite messy, so I restricted the x-axis to 2021 here. The red line for "after" is now at 0 -- which is the expected result from the data reconciliation algorithm.Plot adjustments.ipynb
A dropdown can be used to select any timeseries and see how it was adjusted. For example: BPAT hydro.
Commit log