usds / justice40-tool

A tool to identify disadvantaged communities due to environmental, socioeconomic and health burdens
https://screeningtool.geoplatform.gov/
Creative Commons Zero v1.0 Universal
133 stars 42 forks source link

Statistical analysis of characteristics of census tracts included in one measure (any of the above) not included in ours. What are the differences between the census tracts in our tool but not in others, and vice versa? #244

Closed BethMattern closed 2 years ago

BethMattern commented 3 years ago

Problem statement/question Based on https://github.com/usds/justice40-tool/issues/245 and https://github.com/usds/justice40-tool/issues/135, it would be interesting to run a statistical analysis of the difference between census tracts included as priority communities in a comparison metric but not in the current CEJST score. E.g., "Census tracts that are included in CalEnviroScreen disadvantaged communities but are not included in the current CEJST priority communities have 20% higher incomes and 34% less linguistic isolation on average than CalEnviroScreen census tracts that are included in the current CEJST priority communities."

saran-ahluwalia commented 2 years ago

Just adding this as a note. In CalEnviroScreen methodology (and verbatim):

For each census tract, the data was analyzed to estimate the number of households with household incomes less than 80% of the county median and renter or homeowner costs that exceed 50% of household income. The percent of the total households in each tract that are both low-income and housing-burdened was then calculated.

  1. RSE less than 50 (meaning the SE was less than half of the estimate) OR
  2. SE was less than the mean SE of all California census tract estimates for housing burdened low income households.
lucasmbrown-usds commented 2 years ago

This ticket is already completed, or at least a basic version is: see census_tracts_score_comparisons in the comparison tool.

I'd propose we close this for now – please re-open if it feels premature.

@saran-ahluwalia let me know if I'm misunderstanding, but I'm not sure the comments you've pasted above are relevant to this topic? They're definitely relevant to conversations about handling null values, etc, but not this specific ticket.

saran-ahluwalia commented 2 years ago

@lucasmbrown-usds Yes, that was just a note for myself and posterity to outline how the CalEnvironScreen's methods for assigning tracts with null values. That was all. This would be a reference for future issues.