Closed ksewalk closed 1 year ago
Since one data set includes state-level data and the other does not, using the day zero schema might make the most sense.
We don't have timeseries for state-level right? In that case, having the aggregate counts is all we can provide; the day zero schema is meant for a line list structure, requiring a confirmation date at least, which does not apply here.
Sure, why not use it anyway? It gets us closer to the turnkey system, gives us the option to pivot back to using the line list if we wanted, and we can generate the aggregate counts using it. Do we need a confirmation date? Can we either leave that field blank or fill it with whatever CDC and WHO provide us? The user would not see a line list, they would just see counts, but we would use the line list behind the scenes.
While we do not have a time series file for state-level case data, we will be able to provide one once we get data into the system. We could use the confirmation date value for it.
We need to regularly update G.h's data set to include WHO and CDC case counts. For CDC, this is US/state-level case counts; for WHO it is all other countries.