Open atperzynski opened 6 years ago
Note that SEER apparently now includes some tract level data:
https://seer.cancer.gov/seerstat/databases/census-tract/index.html
There is a project out of California that used some of this data about 10 years ago, but not much since.
For my notes:
Include an ADI agreement table and a map with ADI ratio shading in the README/future vignette.
This is my first draft of specs for how Sociome should work, in outline form.
The goal of the sociome package is to help the user to operationalize social determinants of health (SDOH) data in their research.
Sociome will allow users to access individual variables and indices from the US Census and American Community Survey efficiently, using the Census API and functionality first demonstrated by Kyle Walker in his tidycensus package
Users will be able to operationalize commonly used SDOH variables (e.g. % of population living below federal poverty level) and indices (e.g Singh's Area Deprivation index) with minimal need for data cleaning and processing, simply by executing the appropriate package commands
Sociome will allow users to build tables of data for any state or for the entire United States
Sociome will allow users to create data at the level of the block group, census tract, county or state
Users will be able to create data for any individual year where data is available from the Census, and users can select whether to use decennial census, or American Community Survey 1, 3 or 5 year estimates (with appropriate constraints for data accuracy/availability)
Sociome will allow for imputation of missing values
Sociome will use a data reduction (e.g. factor analysis, principal components) approach for creation of indices.
Creation of indices will allow for use of factor weights at multiple possible reference areas (e.g. nation, state) as well as custom defined reference areas.
Sociome will include an efficient approach to visualizing data in chloropleth maps.
While the default distribution of an index like ADI will utilize the Singh method, users will be able to specify custom transformation to z scores, percentiles and deciles.
Sociome will default to using the 2010 Census area definitions; with other area definitions becoming available in future releases.
Future releases will include the ability to create data sets including the CDC 500 Cities data, available via the Socrata API here: https://dev.socrata.com/foundry/chronicdata.cdc.gov/csmm-fdhi
A future release will include internet access, presence of a computer, and presence of a smartphone for the 2013 to 2016 data when these questions were asked. See https://www.socialexplorer.com/data/ACS2015/metadata/?ds=ACS15&table=B28002
and https://www.socialexplorer.com/data/ACS2016/metadata/?ds=ACS16&table=B28001