tomquisel / covid19-data

17 stars 5 forks source link

0324 data has many dupes from today before #12

Open bennybetch opened 4 years ago

bennybetch commented 4 years ago

Some of the data, e.g. New York county has changed from 0323, but most are the same. Any idea?

by the way, this is super helpful!!

tomquisel commented 4 years ago

Great catch! Unfortunately I missed doing the second data pull on 3/24, so 3/24 is only data from that morning (missing many updates). The truth for 3/24 at the end of the day is somewhere between the 3/24 file and the 3/25 update 1 file. I'm not sure how to recover that. You could linearly interpolate maybe, but I don't love making up the data. Maybe there's another historical source now?

cscollett commented 4 years ago

I'd vote for making a note in the README, and possibly removing the duplicate values not updated. It makes it clear the data is absent, and people can either interpolate or just rely on the plotting software to do that work for them.