Closed lucasmbrown-usds closed 3 years ago
The UI will look something like this for Urban/Rural
Look at https://github.com/usds/justice40-tool/pull/660 to see a similar example of how to incorporate into the ETL scripts.
The actual CSV you get is attached here
Basically the data set is unique at the
census block group, Urban/Rural indicator
level. A census block group might show up twice if it has both urban and rural components. If the census block group has both urban and rural components, it will show up as two rows, one for "U" and one for "R" with the weighting given by population.
The query string should look something like this: https://github.com/usds/justice40-tool/blob/47df35b77e840b2b2e303853db751d58c355d874/data/data-pipeline/data_pipeline/etl/sources/census_acs_median_income/etl.py#L178
I've put the data set in my local laptop as:
/c/github/justice40-tool/data/data-pipeline/data_pipeline/data/geocorr
just to play around with it.
Slack thread on running code: https://usds.slack.com/archives/C0222FBGQ65/p1631808402174600
Description We currently have scores that rank order each community (Census Block Groups). One can map each community to either Rural or Urban. We want to know what the distribution of the rank ordering is across urban and rural communities. Some starting questions might be:
Note, this literally applies to scores that generate a classification (1 or 0) of disadvantaged, which admittedly are most of the scores. For score that produce a rank ordering, will need a different definition, like mean or some other way to measure the distribution.
Links to user research or other resources
Tasks
Definition of "Done"
Relevant Links