ozanj / rclass

4 stars 3 forks source link

add some school and community characteristics to prospect list data frame #8

Open ozanj opened 6 years ago

ozanj commented 6 years ago

@cyouh95 @mpatricia01

This task is for Crystal, but I am @ing you too Patricia so you are aware of what's going on.

in lecture 3 of my R-class [next Friday] we will work a little bit with the prospect list dataset from Western Washington University.

I put the data and R-code to read it here on github folder [path on my pc] C:\Users\ozanj\Documents\rclass\data\prospect_list

the relevant R script is create_prospect_list.R

the problem w/ this dataset is that it is kind of boring because we don't have any variables that describe characteristics of the schools and the communities.

Task:

Crystal, would you be willing to merge in some community characteristics variables and some school characteristics variables. (e.g., race, population, income, etc.). doesn't need to be a huge number. just to make the dataset more interesting to work with for students. and preferably a mix of variable types (e.g., numeric vs. character, integer/factor vs. double)

Is it possible to do this by Monday AM? I'm going to spend monday revising this lecture, so it would be helpful to have the updated data while I do this.

thank you so much Crystal!!!

cyouh95 commented 6 years ago

Here is the updated data: 73c9851 (pls let me know if this looks okay or if there's anything else needed!)

The new cols are:

But only about half of the rows got matched to HS data due to limitations of the crosswalk (94 matched private HS + 133571 matched public HS = 133665 rows matched to HS out of 268396 rows total)

(And not sure if we need the zip_code/merged_hs cols in final data - maybe those could be removed)

cyouh95 commented 6 years ago

@ozanj Here is the updated CSV w/ school_category and total_12 columns. I think other cols available that is common for public and private HS could be Title I Status (Yes, No, Missing, NA) or total 9-11 graders, if we need those as well?