cmu-delphi / covidcast-indicators

Back end for producing indicators and loading them into the COVIDcast API.
https://cmu-delphi.github.io/delphi-epidata/api/covidcast.html
MIT License
12 stars 17 forks source link

Consider changing JHU indicator to not report incidence zeros #1757

Open krivard opened 1 year ago

krivard commented 1 year ago

Zeros appear in JHU incidence signals when the cumulative number of cases or deaths remains the same between one day and the next. This does not necessarily mean that zero new cases or deaths occurred on that day: many regions have downgraded their reporting cadence, so that all the new cases or deaths that occurred in a week are collected together and reported in a batch on e.g. Thursdays.

Alas, this also does not necessarily mean that zero new cases or deaths did not occur on that day: small regions often go many weeks between new actual cases or deaths, so many/most/all of those zeros are true zeros.

It seems like a maintenance nightmare to try and keep accurate track of which regions are expected to report on which days so that we can suppress reporting of zeros on other days. It also seems like it would be difficult to consistently decide whether a region was "small enough" that their zeros were true zeros.

We should switch to never reporting zeros in incidence signals from JHU. It is much less harmful for a researcher to misinterpret the days before a small count in a small region as "not reported" days than for that researcher to misinterpret a large region going from zero cases to a large number in a single day.

This change will require:

krivard commented 1 year ago

@dshemetov thoughts on this idea? It came up while Roni and I were talking about the coding of non-numerics.

dshemetov commented 1 year ago

@krivard A few initial thoughts (sorry for the delay):

krivard commented 1 year ago

we would need to do time series processing to zero-fill incidence signals for the dashboard

i think we can actually ignore this; wouldn't just leaving those points out be more consistent with this change?

dshemetov commented 1 year ago

It probably depends on how bad the plots look without gap-fills (and how we're handling it currently). It would be very convenient code-wise to just plot nothing in the gaps.

nmdefries commented 1 week ago

I thought we were essentially reporting JHU data as-is and doing minimal processing. So doing this seems like it would be adding unnecessary post-processing to the data.