Chicago / clear-water

Forecasting elevated levels of E. coli at Chicago beaches to provide proper warning to beach-goers.
http://chicago.github.io/clear-water
55 stars 43 forks source link

Removing rows with NA for Client.ID #115

Closed nicklucius closed 7 years ago

nicklucius commented 7 years ago

The code in Master.R removes each row where Client.ID is NA. A lot of these rows have E. Coli readings.

Are these listed as NA for Client.ID in the data portal? If so, do we have better data in the hacknight branch? Maybe there is just an issue with the data portal data.

CallinOsborn commented 7 years ago

These NA for Client.ID comes in from when we do the clean of the beach names. For example, in 2016 we have readings for the Montrose Dog Beach. In the cleanbeachnames.csv we have that set to going to NA. There are some beaches that are now defunct, etc. If we want to look into what we are changing to NA we can, otherwise we have all the information for the beaches we are predicting in df.