Closed dt-woods closed 4 months ago
thanks - good spot. I will take a closer look as soon as I can but I expect we can get this updated.
For some reason we were missing that column from the 2020 and 2021 files, perhaps because the name changed, I'm not quite sure. I will add that we use the file that I edited in 39a49bb to ensure consistent field names across years, which we realized a few years back when we started adding more years. Not the cleanest system, but it should ensure the same column names across data years for the facilities file (this specific issue notwithstanding)
I've confirmed that column has been added for 2020 and 2021. I will pull into master but may not be able to update data commons immediately w/ a new release for egrid data (v1.1.3).
thank you for reporting @dt-woods!
I appreciate the quick turnaround, Ben, and look forward to the release of the new revision.
@dt-woods the new processed egrid files for 2020 and 2021 are now up in data commons for v1.1.3
Stewi's getInventoryFacilities method for "eGRID" returns a data frame using row 1's column names from "PLNT16" or "PLNT20" worksheet of the respectively downloaded Excel workbook (e.g., 'egrid2016.xlsx' and 'eGRID2020_Data_v2.xlsx'); however, these names fail a consistency check between 2016 and 2020. Row 2 of the worksheet includes a keyword for the column, which appears to be consistent. This creates a challenge for data users when dealing with multi-annual datasets (i.e., I have to write a check for multiple column names rather than a single check against the keyword).
Testing Stewi's getInventoryFacilities method for "eGRID" 2020, it seems to be missing the primary fuel category.
Reproducible example:
In its respective data file, the W column "Plant primary coal/oil/gas/ other fossil fuel category" represents the "PLFUELCT" keyword I am looking for in eGRID 2016.
In 2020, the now Y column name is changed to "Plant primary fuel category" (note the keyword is still "PLFUELCT") and, for some reason, is not provided in the data frame.
Would it be possible to include the PLFUELCT column in all eGRID facility datasets?