nytimes / covid-19-data

A repository of data on coronavirus cases and deaths in the U.S.
https://www.nytimes.com/interactive/2020/us/coronavirus-us-cases.html
Other
6.99k stars 3.46k forks source link

Is the data on sex and age available anywhere? #11

Closed macrophyte closed 1 year ago

Mantissa-23 commented 4 years ago

Also interested in this.

CBG-63 commented 4 years ago

This will likely not be available, as it may violate HIPAA laws. While some of these have been temporarily waived for doctors to treat patients in other states, etc., it likely doesn't apply to the public.

girlinamovie commented 4 years ago

Agree with CBG-63

sareeneng commented 4 years ago

HIPAA identifiers - https://www.luc.edu/its/aboutits/itspoliciesguidelines/hipaainformation/18hipaaidentifiers/

Sex is not an identifier Exact age is not an identifier if <= 89 years old. These patients could be all binned together as >89. It cannot be linked to county-level data, but can be linked to state-level data.

Patient-level data is highly important. Right now we have no ways of risk-stratification (speaking as a PICU physician in New York), and all the publicly available data is focused towards epidemiology which doesn't help us take care of patients that are currently infected in the hospital.

thoughtafter commented 4 years ago

No one is releasing individual case data as far as I am aware. The issue for age is that states are binning ages differently so it's not very usable data.

wesg52 commented 4 years ago

It would be worth joining this with census data by county to identify high risk clusters i.e. those counties with higher than average elderly populations.

timriffe commented 4 years ago

No one is releasing individual case data as far as I am aware. The issue for age is that states are binning ages differently so it's not very usable data.

Can you give some pointers to states publishing cases and deaths in binned ages?

thoughtafter commented 4 years ago

No one is releasing individual case data as far as I am aware. The issue for age is that states are binning ages differently so it's not very usable data.

Can you give some pointers to states publishing cases and deaths in binned ages?

OR (under "Demographic and Hospital Information"): https://govstatus.egov.com/OR-OHA-COVID-19 WA: https://www.doh.wa.gov/Emergencies/Coronavirus

Joe-Wasserman commented 4 years ago

There are line lists of individual cases with demographic information (as much as available) being compiled here: https://github.com/beoutbreakprepared/nCoV2019

paoist commented 4 years ago

OR: https://govstatus.egov.com/OR-OHA-COVID-19 WA: https://www.doh.wa.gov/Emergencies/Coronavirus

Interesting, both have more female than male deaths.

Seems important that aggregated gender statistics are made available nation wide.

amcdavid commented 4 years ago

I am also very interested in finding age-stratified data, especially of deaths. It seems almost negligent to compare incidence of death between time or place without this. The idea of joining to census data is interesting, and would be most appropriate if infection is independent of age. We know this isn't true on a micro level because nursing homes, etc, result in inhomogeneity. Any other ideas?

paoist commented 4 years ago

This link is a fairly updated and comprehensive with source references for most countries but since it’s in most cases are just aggregated data it’s far from as useful as it could be. Some of the official sources listed provide time series in the native languages but only aggregated data on age, gender and preconditions which is frustrating. https://www.worldometers.info/coronavirus/#countries https://www.worldometers.info/coronavirus/#countries One can only hope the WHO or someone in a centralised place get better data but if that’s the case they really ought to share much more publicly.

On 29 Mar 2020, at 03:56, Andrew McDavid notifications@github.com wrote:

I am also very interested in finding age-stratified data, especially of deaths. It seems almost negligent to compare incidence of death between time or place without this. The idea of joining to census data is interesting, and would be most appropriate if infection is independent of age. We know this isn't true on a micro level because nursing homes, etc, result in inhomogeneity. Any other ideas?

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/nytimes/covid-19-data/issues/11#issuecomment-605545547, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAIU3IWSJFCTBQE323QSHYLRJ2TEFANCNFSM4LVD3M2Q.

Otteholt commented 4 years ago

Thoughtafter states

No one is releasing individual case data as far as I am aware. The issue for age is that states are binning ages differently so it's not very usable data.

As a workaround to this issue it seems like the average age of a person’s assigned bin could be determined and then utilized. These values could be obtained for patients in all states.

Detailed age resolution would be lost, but the bins used in each state are similar enough to provide valuable insights.

Could this be provided NYT?

Dawnteach commented 4 years ago

No description provided.

Hi, not sure if this will help but Georgia is reporting actual age data, sex and even underlying conditions in general. Maybe that will clear the way for you to do the same in New York. Thinking of everyone since I have a daughter who lives in Manhattan!

https://dph.georgia.gov/covid-19-daily-status-report

timriffe commented 4 years ago

Santa Clara county gives a dashboard with 10-year age groups for cases and deaths: https://www.sccgov.org/sites/phd/DiseaseInformation/novel-coronavirus/Pages/dashboard.aspx

shoffer commented 4 years ago

It always fascinated me when people toss around a law or regulation like HIPAA or PIA and do not ever bother to read or understand it. Thankfully, some of you do and correct for this.

The data is worthless without certain attributes.

Age and underlying conditions are critical to understanding what is going on. Without at least age, the reports are a meaningless list that does nothing but allow for panic or complacency. It fosters the extremes. Here are some other attributes that are critical. Duration of illness Severity of symptoms List of symptoms Recovery Recurrence number( per person)

Most reports are generalities that provide no value...just a map of giant red like blood as if the world is ending.

Glad some areas are being more transparent about it such as the references Santa Clara( I have not validated.

Rant over. Stay well everyone.

timriffe commented 4 years ago

Ohio now posting data with sex, age groups, and even some other variables, very nice. https://coronavirus.ohio.gov/wps/portal/gov/covid-19/home/dashboard

brendantodaro commented 4 years ago

I compiled some of the available state-level data (FL, GA, MI, NYC, OH, OR, WA) on death rates by age here: https://github.com/brendantodaro/US-COVID-deaths-by-age/blob/master/README.md

shoffer commented 4 years ago

Thanks for this information.

Steve

On Apr 2, 2020, at 1:03 PM, Tim Riffe notifications@github.com wrote:

Ohio now posting data with sex, age groups, and even some other variables, very nice. https://coronavirus.ohio.gov/wps/portal/gov/covid-19/home/dashboard

— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.

shoffer commented 4 years ago

Well done! Will be looking at this later today.

Steve

On Apr 5, 2020, at 1:03 PM, Brendan Todaro notifications@github.com wrote:

I compiled some of the available state-level data (FL, GA, MI, NYC, OH, OR, WA) on death rates by age here: https://github.com/brendantodaro/US-COVID-deaths-by-age/blob/master/README.md

— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.

shoffer commented 4 years ago

Just another question: Has anyone done any side-by-side comparison of the details fo cases and deaths, with comprable demographic and associated data for SARS, Swine Flu, Bird Flu or any Seasonal Flu? Without context and comparison, it is hard to see/make judgement on where this pandemic lines up in contrast to others