epiforecasts / covid-us-forecasts

Forecasting Covid-19 in the US
https://epiforecasts.io/covid-us-forecasts/submissions/report.html
MIT License
8 stars 3 forks source link

Anomaly detection: round 2 #93

Open kathsherratt opened 3 years ago

kathsherratt commented 3 years ago

Comments by @seabbs

I think the missing piece of the puzzle is to go back to doing some anomaly correction but in a less hardcore fashion that we did last time. Previously, we corrected all of the data (both that used for fitting and that used when plotting) and it led us a little astray (we thought we were doing well but never saw the truth data and so never knew how we were actually doing). Adding a second anomaly cleaned data stream and using that for fitting whilst keeping the current truth data everywhere else seems like a good option. In terms of anomaly detection something fairly light seems sensible. Perhaps just having an allowed week to week change (i.e Monday to Monday and perhaps in the order of 200%) and setting to the backwards looking 7 day average if it exceeds this? The other critical thing we didn't have before was some awareness of how much and when we are doing this so flagging that and perhaps adding to the summary report seems like it would be really useful.

kathsherratt commented 3 years ago

I guess this involves:

seabbs commented 3 years ago

Nice work Kath,

Some thoughts:

kathsherratt commented 3 years ago

Nothing new but just dropping in here some useful resources for manual data sense-checks (not sure where else to keep this) https://github.com/nytimes/covid-19-data/issues?q=is%3Aissue+label%3Adata-issue+ https://github.com/CSSEGISandData/COVID-19/issues

kathsherratt commented 3 years ago

Flagging this function specifically for checking a range of methods for anomaly detection (used by Reich lab on US data): https://github.com/reichlab/covidData/blob/master/R/identify_outliers.R https://github.com/reichlab/covidData/blob/master/vignettes/outliers.R

Also, linking to #97 which looks to me like a near duplicate + expansion of this issue

kathsherratt commented 3 years ago

Pinning this issue. We will need to