govex / COVID-19

Data analysis and visualizations of daily COVID cases report
MIT License
206 stars 155 forks source link

Announcement: change in U.S. vaccine data structure #351

Open sarabertrandelis opened 3 years ago

sarabertrandelis commented 3 years ago

Hello all

Since mid-December, the Coronavirus Resource Center team has been manually collecting U.S. state vaccine data from publicly-reported sources. The data reported have evolved over the past two months, as well as the vaccine types being distributed. To accommodate the changing landscape, we are switching over to an automated data collection. Not all states are reporting the same variables, so the dataset will be a mix of U.S. state-reported and CDC-reported data. Details about the logic/criteria regarding the use of U.S. state dashboard vs. CDC data will be explained in a repository readme file. As a result, there will also be a change in data structure. Soon, we will also add a "people" table to the repository, which will allow us to track doses and people vaccinated separately. The new dataset can be found here. The data dictionary and list of sources can also be found in that repository. We will concurrently update the original dataset, available here, until 2/23/21, to give consumers of the data an opportunity to update their systems. After that date, it will be archived in a different folder.

PGuimarais commented 3 years ago

@sarabertrandelis do we know when we can expect J&J vaccines to start reporting / how can we expect the data structure to change?

Currently on vaxdash.us, we weight "% administered" by dividing total number of doses_admin by 2; with this new vaccine, will there be a separate field tracking this single-dose vaccine, or will it be aggregated with the rest of them?

LouisFrely commented 3 years ago

@sarabertrandelis, thanks a lot for compiling this amazing data!

I would also be interested in knowing how the one-dose vaccines will be taken into account, thanks @PGuimarais for raising this.

Thanks a million!