UBC-MDS / college_scorecard

Discovering how school size impacts soci-economic student metrics. Created by: sedv8808 and smwatts
0 stars 2 forks source link

Added data cleaning & data exploration scripts #4

Closed smwatts closed 5 years ago

smwatts commented 5 years ago

Hi!

Based on our last discussed changes, I edited the data cleaning script to read in data from:

A. Vincenzo's provided dataset (https://github.ubc.ca/MDS-2018-19/DSCI_532_viz-2_students/blob/master/oversized_datasets/scorecard.csv)

B. A subset of the relevant features (e.g. count of students) from Most-Recent-Cohorts-All-Data-Elements.csv (https://ed-public-download.app.cloud.gov/downloads/Most-Recent-Cohorts-All-Data-Elements.csv)

The script then joins and cleans the data so we are able to use it in our app!


I also created a script that will generate the graphs we sketched for our app (they are super interesting!!!)


Have a look through and let me know if you have any comments! This should hopefully get us in a good place to start building out the app :)