GeoDaCenter / covid

COVID Atlas alpha code
https://geodacenter.github.io/covid/
GNU General Public License v3.0
47 stars 19 forks source link

Need Volunteers: State Health Department COVID Case Overview #11

Closed Makosak closed 4 years ago

Makosak commented 4 years ago

Calling volunteers! We need help updating a document to identify how, and at what scale, each state health department is recording confirmed cases, testing (ie. negative cases), and deaths. This will be essential to help confirm county-cases on a daily level, and identify what will be automated vs what will require human editors.

https://docs.google.com/spreadsheets/d/1b3ElJC8AnwnYfBupBmoJEZJ8D53YfF719R2AL4ierb0/edit?usp=sharing

bibindvasu commented 4 years ago

There are tested cases in many states and the data is listed county wise. But they. may not be hospitalized. Do we need to capture the testing data too.

Makosak commented 4 years ago

Yes, if there is tested cases that would be the (-) cases column. Confirmed cases = (+) cases. If there is a better indicator/way of putting this in the spreadsheet feel free! Seems like we need to capture following in all:

Likely only confirmed cases and deaths will be included in all states, but tracking what's available will be super useful for the county validation work, and for future research.

On Thu, Mar 26, 2020 at 4:07 PM Bibind Vasu notifications@github.com wrote:

There are tested cases in many states and the data is listed county wise. But they. may not be hospitalized. Do we need to capture the testing data too.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/GeoDaCenter/covid/issues/11#issuecomment-604687242, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABAULPBGOLQ3ENUQS7ZYHP3RJO72PANCNFSM4LSD6MRQ .

-- Marynia A. Kolak, PhD, MFA, MS Assistant Director of Health Informatics Assistant Instructional Professor in Geographic Information Science Center for Spatial Data Science at the University of Chicago

Makosak commented 4 years ago

Bumping this as a critical need -- searching for more volunteers to log state dept health info. We will use the metadata from this doc to begin a hospitalization dataset that will be essential for communities impacted. While only a few states are opting to share hospitalization data now, that may change in the future.

disulfidebond commented 4 years ago

Do you want only Wisconsin or elsewhere too?

To put it another way, I wrote a scraper that can handle non-JS sites, and a mostly-working JS scraper that can grab covid statistics. Would these help?

ryanwyg commented 4 years ago

I have completed the google sheet with notation on ICU and negative cases reporting. I have also added a "recovered" column as some states are reporting that information. This could potentially be helpful in later stages of this project. I will monitor state websites to revise any updated info.

Makosak commented 4 years ago

Amazing, thank you guys!! I will leave open until we have a README filled up where we can link to it directly. Thanks @ryanwyg for your amazing work here.

The long-term interest would be using these sources as our gold standard to validate against, but the huge variety of formats poses an issue.

@disulfidebond would you want to start linking the JS-scraped WI gold standard to check against the counties.geojson file numbers -- ie. link up with the county validation team? (@JohnWSteill @SteveGoldstein @linqinyu @Sihan-Mao) as that will be the final part of that important workflow!

Makosak commented 4 years ago

Also @disulfidebond -- Another option could be working on scraping the hospitalization numbers for the areas we can get them, to start a new time-series dataset (with daily updates, similar to the counties.geojson master file). This was identified as a critical task by the health sounding board we have, too. If your'e interested, feel free to throw up as another issue.

RandomFractals commented 4 years ago

@Makosak this dataset has hospitalization and other demographic data you could use for displaying that information: https://www.kaggle.com/jieyingwu/covid19-us-countylevel-summaries

Makosak commented 4 years ago

closing this issue for now, thanks team for getting this document together!