CityOfLosAngeles / covid19-indicators

Key COVID-19 and public health indicators for reopening
Apache License 2.0
8 stars 1 forks source link

Add function to deal with outliers #75

Closed tiffanychu90 closed 3 years ago

tiffanychu90 commented 3 years ago

In LA, on 5/27/21, it was recorded that there were 4,000+ new cases from the prior day. It's reflected in the raw data in JHU. Rather than suppressing individual outlier dates across counties, use a function that compares the new cases of a given day against new cases of prior day and the day after. If new cases recorded are more than 10x the previous day's or the day after's, then exclude that as outlier. Otherwise, outliers mess up our 7-day rolling average.