Closed padilla410 closed 2 years ago
DRAFT
I am actively working on this issue right now. There are enough small issues with the data that I am going to track things here on a year by year basis to help with the final munge.
Data Quality Review
Global issues
Missouri_USACE_samplingLocations.xlsx
(e.g., PT24
in the sampling locations spreadsheet but PT-24
in the data spreadsheet). There is also inconsistent use of case.PE_dam
from 2010 doesn't have a location)2009
2010
LB
)PT
)PE_dam
) that matches the site list in
Missouri_USACE_samplingLocations.xlsx
2011
2012
2013
2014
2015
2016
PT
, ST
, PO
, ME
, WI
)HT
) but site is always in the first column2017
TC
row 92 in Missouri_USACE_2017_profiles.xlsx
). This seems like a small enough component of the data that I am going to ignore it (2/4398 records)2018
LB
) has missing column names at the top. Needs modified wrangling 2019
2020
2021
US Army Corps of Engineers (USACE) reservoir profile data from 2009-2021. Data includes the following:
Other parameters (not included in all data sets):
For a given year, the data is organized by reservoir, with multiple locations for each reservoir. Lat/longs are provided in a separate excel file called
Missouri_USACE_samplingLocations.xlsx
. The format of the data is optimized for "in-workbook" data analysis and some workbooks contain figures.Note - there is likely sampling location overlap between this data set and the following data sets: