wdoyle42 / cc_earn

ML approach to predicting earnings of CC completers
MIT License
1 stars 0 forks source link

County Level Predictors #1

Open wdoyle42 opened 2 years ago

wdoyle42 commented 2 years ago

List of predictors from ACS at county level:

-Median Income -Labor force participation -Education levels -Housing

Small Areas estimates from BLS

-Occupation/Industry employment -Business activity

btskinner commented 2 years ago

@onmorales23, could you take first stab at putting together a data set with these variables?

Ideally, there will be a script (get_county_data.R) that uses packages to interact with ACS and BLS APIs, pulls the data and produces a data set that looks like

County ID (FIPS) Median income ... Employment
32605 <...> ... <...>

For right now, let's say the data should come from the year of college graduation (so if the scorecard data are for earnings in 2020 for graduates one year out, then values for 2019). If that's a problem, we can figure out the best period.

Post questions here or come talk to me, particularly if you are unsure what variables to pull.

btskinner commented 2 years ago

@onmorales23, from our discussion today, can you move this work to a new branch? Maybe dev_county_predictors or something like that?