nytimes / covid-19-data

A repository of data on coronavirus cases and deaths in the U.S.
https://www.nytimes.com/interactive/2020/us/coronavirus-us-cases.html
Other
6.99k stars 3.46k forks source link

Data Issue: Validate new cases for Massachusetts Nov 20-22 #648

Closed MehraSakshi closed 2 years ago

MehraSakshi commented 2 years ago

We have searched for the new cases for Massachusetts, showing 7141 for 22nd November and 0 for 20th and 21st November, 2021.

Same data can be seen on NYT (https://www.nytimes.com/interactive/2021/us/massachusetts-covid-cases.html) and CDC(https://covid.cdc.gov/covid-data-tracker/#trends_dailycases).

But the data is different on Gov Website (https://www.mass.gov/info-details/covid-19-response-reporting#covid-19-interactive-data-dashboard-) and Worldometer (https://www.worldometers.info/coronavirus/usa/massachusetts/).

Github source: https://raw.githubusercontent.com/nytimes/covid-19-data/master/us-states.csv

Can you please check and confirm if the stats for Massachusetts are correct on github?

tiffehr commented 2 years ago

Our source for Massachusetts is their Tableau dashboard. We check the page every hour, if not more.

Today's numbers, for reference image

Our figures are always cumulative totals that the state reports for that given day, and if the numbers don't change day over day, we show zero. Our raw data collection reported:

We saw no new increases between those dates, which means the state did not update their state-level figure. They may have retroactively reported new confirmed/probable cases on those dates in breakout charts, but did not do so on the day we collected figures. Our README has a lot of information, but includes that "All cases and deaths are counted on the date they are first announced." But if the state/county doesn't announce a change, we don't retroactively try to find their figures unless it's a serious lull. This is why our own display focuses on rolling averages, to smooth out days with that provided.

tiffehr commented 2 years ago

Colleagues add that MA's day-over-day cases charts are by test date vs. date announced. So that's going to be challenging to sum their figures in those charts to their own cumulative number for the state overall.