PyPSA / powerplantmatching

Set of tools to combine multiple power plant databases
https://powerplantmatching.readthedocs.io/en/latest/
GNU General Public License v3.0
156 stars 54 forks source link

Update IRENASTAT to 2023 data #157

Closed koen-vg closed 6 months ago

koen-vg commented 6 months ago

Change proposed in this Pull Request

Update IRENASTAT source to 2023, with accompanying changes to accommodate for slight updates in format.

Mostly this just adds the three years of 2021, 2022 and 2023. Some of data from 2019 and 2020 also have been ever so slightly revised in this new version. See for example:

Master branch:

>>> df = pm.data.IRENASTAT()
>>> df.loc[(df.Country == "Germany") & (df.Technology == "PV")]
          Grid Technology  Year  Country Fueltype  Capacity
24864  On-grid         PV  2000  Germany    Solar     114.0
24904  On-grid         PV  2001  Germany    Solar     195.0
24948  On-grid         PV  2002  Germany    Solar     260.0
24994  On-grid         PV  2003  Germany    Solar     435.0
25041  On-grid         PV  2004  Germany    Solar    1105.0
25091  On-grid         PV  2005  Germany    Solar    2056.0
25147  On-grid         PV  2006  Germany    Solar    2899.0
25211  On-grid         PV  2007  Germany    Solar    4170.0
25282  On-grid         PV  2008  Germany    Solar    6120.0
25364  On-grid         PV  2009  Germany    Solar   10564.0
25458  On-grid         PV  2010  Germany    Solar   18004.0
25561  On-grid         PV  2011  Germany    Solar   25914.0
25685  On-grid         PV  2012  Germany    Solar   34075.0
25819  On-grid         PV  2013  Germany    Solar   36708.0
25966  On-grid         PV  2014  Germany    Solar   37898.0
26124  On-grid         PV  2015  Germany    Solar   39222.0
26289  On-grid         PV  2016  Germany    Solar   40677.0
26462  On-grid         PV  2017  Germany    Solar   42291.0
26636  On-grid         PV  2018  Germany    Solar   45156.0
26819  On-grid         PV  2019  Germany    Solar   49045.0
27006  On-grid         PV  2020  Germany    Solar   53781.0

This PR:

>>> df = pm.data.IRENASTAT()
>>> df.loc[(df.Country == "Germany") & (df.Technology == "PV")]
            Grid Technology Fueltype  Country  Year  Capacity
77088   Off-grid         PV    Solar  Germany  2000       0.0
77089   Off-grid         PV    Solar  Germany  2001       0.0
77090   Off-grid         PV    Solar  Germany  2002       0.0
77091   Off-grid         PV    Solar  Germany  2003       0.0
77092   Off-grid         PV    Solar  Germany  2004       0.0
77093   Off-grid         PV    Solar  Germany  2005       0.0
77094   Off-grid         PV    Solar  Germany  2006       0.0
77095   Off-grid         PV    Solar  Germany  2007       0.0
77096   Off-grid         PV    Solar  Germany  2008       0.0
77097   Off-grid         PV    Solar  Germany  2009       0.0
77098   Off-grid         PV    Solar  Germany  2010       0.0
77099   Off-grid         PV    Solar  Germany  2011       0.0
77100   Off-grid         PV    Solar  Germany  2012       0.0
77101   Off-grid         PV    Solar  Germany  2013       0.0
77102   Off-grid         PV    Solar  Germany  2014       0.0
77103   Off-grid         PV    Solar  Germany  2015       0.0
77104   Off-grid         PV    Solar  Germany  2016       0.0
77105   Off-grid         PV    Solar  Germany  2017       0.0
77106   Off-grid         PV    Solar  Germany  2018       0.0
77107   Off-grid         PV    Solar  Germany  2019       0.0
77108   Off-grid         PV    Solar  Germany  2020       0.0
77109   Off-grid         PV    Solar  Germany  2021       0.0
77110   Off-grid         PV    Solar  Germany  2022       0.0
77111   Off-grid         PV    Solar  Germany  2023       0.0
179232   On-grid         PV    Solar  Germany  2000     114.0
179233   On-grid         PV    Solar  Germany  2001     195.0
179234   On-grid         PV    Solar  Germany  2002     260.0
179235   On-grid         PV    Solar  Germany  2003     435.0
179236   On-grid         PV    Solar  Germany  2004    1105.0
179237   On-grid         PV    Solar  Germany  2005    2056.0
179238   On-grid         PV    Solar  Germany  2006    2899.0
179239   On-grid         PV    Solar  Germany  2007    4170.0
179240   On-grid         PV    Solar  Germany  2008    6120.0
179241   On-grid         PV    Solar  Germany  2009   10564.0
179242   On-grid         PV    Solar  Germany  2010   18004.0
179243   On-grid         PV    Solar  Germany  2011   25914.0
179244   On-grid         PV    Solar  Germany  2012   34075.0
179245   On-grid         PV    Solar  Germany  2013   36708.0
179246   On-grid         PV    Solar  Germany  2014   37898.0
179247   On-grid         PV    Solar  Germany  2015   39222.0
179248   On-grid         PV    Solar  Germany  2016   40677.0
179249   On-grid         PV    Solar  Germany  2017   42291.0
179250   On-grid         PV    Solar  Germany  2018   45156.0
179251   On-grid         PV    Solar  Germany  2019   48912.0
179252   On-grid         PV    Solar  Germany  2020   53669.0
179253   On-grid         PV    Solar  Germany  2021   60036.0
179254   On-grid         PV    Solar  Germany  2022   67477.0
179255   On-grid         PV    Solar  Germany  2023   81737.0

I created a new record on Zenodo with the new dataset, downloaded from the same source as before: https://zenodo.org/records/10952917. Feel free however to move this to the zenodo record containing the old 2000-2020 dataset.

The new dataset is about 22MB as opposed to 9MB for the old one. This seems to be due to the fact that the old one had many (but all) zero entries removed. If desirable for space/network-saving reasons, I could also remove zero-entries from the new dataset.

Type of change

Checklist