jmhernan / crpe_ed_covid_pulse

1 stars 0 forks source link

Gathering the right tables from the data pull #2

Open jmhernan opened 3 years ago

jmhernan commented 3 years ago

There are 8 education tables that we can extract: https://www.census.gov/data/tables/2021/demo/hhp/hhp22.html#tables

We will need to extract and preprocess to produce this as an outcome: Where Subgroup is race/ethnicity and income. Full Question Answer Subgroup Value (weighted percentage) Subgroup Type Category
jmhernan commented 3 years ago

Will need to update this: https://github.com/jmhernan/crpe_ed_covid_pulse/blob/main/scripts/01_generate_weekly_puf_data.R#L12

For the right data variables focused on education.

jmhernan commented 3 years ago

We need to use the data dictionaries in the GShared drive to create the list of columns we need to extract. See this for reference

Once we locate the columns we can develop script to extract + transform.

padma-g commented 3 years ago

I went through all the data dictionaries (through week 24, which is the most current week), and I think we have all the columns (I'm going to create an updated table). I'm going to work on a script to extract the columns.

jmhernan commented 3 years ago

@padma-g thank you!