Closed ijconlon closed 3 years ago
Looks like they fixed it
Ah, I see that the recent data feed is now updating, but take a look at the archived data set (us-counties.csv). The last date of data in that source is September 29. If the us-counties-recent.csv data source continues to update on a rolling 30-day basis, we'll eventually get to a point where a growing number of days won't appear in either data source.
Seems like a better approach than having a recent rolling dataset would be to have calendar year extracts ("us-counties-
Thanks for creating this issue, and great idea @ijconlon. We have updated the county-level data in the rolling-averages directory to have year-based files available to create a full-pandemic dataset. https://github.com/nytimes/covid-19-data/tree/master/rolling-averages
Fantastic! Glad you liked the idea.
Judging from your avatar, I'm thinking you must be a fellow Tar Heel. I live down in Carrboro. Go Heels! And thanks again!
I saw the recent change to the rolling-average data feed for counties due to size limitations, but the us-counties-recent.csv file has not been updated since September 30. That said, is the idea that this data feed should contain data for the last 30 days, on a rolling basis? If so, I wonder what will happen with the earlier days, since they likely wouldn't end up in that large original us-counties.csv file.
Could a possible workaround here be to just have a second county-level dataset with a start date of September 1, 2021? If people want to report on data earlier than that, they can append the data to the original us-counties.csv file.
Thanks for all you do here--it's really a fantastic resource!