Open rbracco opened 4 years ago
I'm working on this currently and using the following sources
The diabetes data is total to 2012, diagnosed and undiagnosed from http://ghdx.healthdata.org/us-data http://ghdx.healthdata.org/sites/default/files/record-attached-files/IHME_USA_COUNTY_DIABETES_PREVALENCE_1999_2012.zip
Obesity is up to 2011 from here: http://ghdx.healthdata.org/sites/default/files/record-attached-files/IHME_USA_OBESITY_PHYSICAL_ACTIVITY_2001_2011.csv
Smoking is through 2011 from here: http://ghdx.healthdata.org/sites/default/files/record-attached-files/IHME_USA_OBESITY_PHYSICAL_ACTIVITY_2001_2011.csv
Cancer is 5 yr avg taken from here: https://www.statecancerprofiles.cancer.gov/incidencerates/index.php?stateFIPS=00&areatype=county&cancer=001&race=00&sex=0&age=001&year=0&type=incd&sortVariableName=rate&sortOrder=default#results
Hypertension is through 2009, source: http://ghdx.healthdata.org/sites/default/files/record-attached-files/IHME_USA_HYPERTENSION_BY_COUNTY_2001_2009.zip
For each state and county we need
-Population age and sex statistics -Comorbidity statistics (smoking, diabetes, hypertension, cancer, and chronic lung disease)
These will be used to predict hospitalization rate among the infected population in a location and will give more context than just number of cases as the primary concern is the health care system being overwhelmed and mortality spiking.