sfbrigade / data-covid19-sfbayarea

Manual and automated processes of sourcing data for the stop-covid19-sfbayarea project
MIT License
8 stars 10 forks source link

The San Mateo deaths data v2 is missing #171

Closed yetesh closed 3 years ago

yetesh commented 3 years ago

Describe the bug The deaths data for San Mateo County in data v2 is missing: https://github.com/sfbrigade/stop-covid19-sfbayarea/blob/master/data/data.v2.json#L16204

To Reproduce Steps to reproduce the behavior:

  1. Go to data.v2.json file
  2. Search for "san_mateo" and check the "deaths" array
  3. The deaths array has no elements

Expected behavior To move from data v1 to data v2, we need the cases and deaths information for each county

Screenshots

Screen Shot 2021-01-06 at 9 27 55 PM
Mr0grog commented 3 years ago

To add some context here, the scrapers are generally oriented around pulling data from official county sources. San Mateo does not publish a time series of deaths (only the current totals), so we don’t have any data that we can show here.

However, the state does provide county-level deaths alongside cases as a timeseries here: https://data.ca.gov/dataset/covid-19-cases/resource/926fd08f-cc91-4828-af38-bd45de97f8c3

We need to sanity-check that the numbers at least roughly match up to what the county is reporting for current deaths, and if so, consider using it to fill in the deaths timeseries for San Mateo.

Mr0grog commented 3 years ago

Also just noticed this is in the wrong repo. Moving to https://github.com/sfbrigade/data-covid19-sfbayarea.

Mr0grog commented 3 years ago

We need to sanity-check that the numbers at least roughly match up to what the county is reporting for current deaths, and if so, consider using it to fill in the deaths timeseries for San Mateo.

Checked this last night, and it is reasonable/feasible. The totals match up.