nychealth / coronavirus-data

This repository contains data on Coronavirus Disease 2019 (COVID-19) in New York City (NYC), from the NYC Department of Health and Mental Hygiene.
https://www1.nyc.gov/site/doh/covid/covid-19-data.page
957 stars 649 forks source link

Vaccine Status Definition and Age-Adjustment Clarifications #235

Closed warrtalon closed 2 years ago

warrtalon commented 2 years ago

Hello, @acharney2 , in this Readme, the definitions for "Vaccinated" and "Unvaccinated" leave room for interpretation.

https://github.com/nychealth/coronavirus-data/blob/master/latest/Readme.md

May we know the following?

Breakthrough Summary.csv: 1) In what category do you place those who are <14 days after the most recent dose completing their primary series?

Now-Weekly-Breakthrough.csv: 1) Does this table include all COVID outcomes for each week, or is there another set of outcomes not seen because they don't fit the "Unvaccinated" or "Vaccinated" definitions? 2) Hosp/Death rates:

My chart here shows what I mean. The yellow line is my calculation for crude rate while the blue line is yours. It makes a huge difference (blue vs. yellow): image

This 2nd version includes my interpolation for weeks 1/8-1/29 since those seem to have incomplete Vax Record Matching: image

These details are important, because you can see how that artificial blue hump dramatically impacts the chart on the website: image

Since the charts are being shared widely on social media by Commissioner Chokshi and very large media accounts before Vax Record Matching is complete, it can lead to public misconceptions and unintended policy impacts. You can see why I care so much about it, and I know you do, too. 🙏

Thank you! -Clayton

dd11235 commented 2 years ago

I was just about to ask about the definition of "Vaccinated" and "Unvaccinated" in the breakthrough data myself so hope you get an answer soon!

warrtalon commented 2 years ago

I was just about to ask about the definition of "Vaccinated" and "Unvaccinated" in the breakthrough data myself so hope you get an answer soon!

Definitions are in that link I gave at the top. Here's a snippet: image

mmontesanonyc commented 2 years ago

Hi, thanks for the interest.

re: breakthrough-summary.csv: we do not have a partially vaccinated category in these data, and partially vaccinated people (those who are <14 days after completing the primary series) are excluded from these data, including the denominators.

re: age-adjustment: You can read more about age-adjustment here. This is a standard analytical process in epidemiology. When two populations have very different age distributions, crude rates primarily reveal age differences, whereas age-adjusted rates separate the effect of the age differences from the effect of the thing you want to study. In this case, the standard and appropriate way to compare these two populations is via age-adjusted rates.

warrtalon commented 2 years ago

Hi, thanks for the interest.

re: breakthrough-summary.csv: we do not have a partially vaccinated category in these data, and partially vaccinated people (those who are <14 days after completing the primary series) are excluded from these data, including the denominators.

re: age-adjustment: You can read more about age-adjustment here. This is a standard analytical process in epidemiology. When two populations have very different age distributions, crude rates primarily reveal age differences, whereas age-adjusted rates separate the effect of the age differences from the effect of the thing you want to study. In this case, the standard and appropriate way to compare these two populations is via age-adjusted rates.

Thank you for the partial vax answer. That makes sense and is good to confirm.

FOr the age-adjustment, I acknowledge awareness of the standard, and I understand why it's usually done. What I'm asking is what exact formula you're using and how it accounts for 75+ having more Unvaxxed than any other age group. This would seemingly cause the age-adjustment to go the other direction. Why would more young people being vaccinated cause age-adjustment to triple the crude rate?

These 2 charts show what I mean: % by Age Group image Raw Count by Age Group image Raw Count with Death Rates (based on Week of 1/8/22) image

May we see the calculations and how this works? If not, it's like a black box that no one in the public can understand.

Thank you!