Open Sean-In-The-Library opened 4 years ago
Not a dumb question at all.
The goals for the lab are (1) be able to interpret a regression that includes dummies, and (2) think about which dummy variables you need to omit to test specific hypotheses. So it a fully-interpretative lab. Everything you need is in the regression table already. We worried that including the data might make people think they need to run additional models to answer the questions, which you do NOT.
If you want to play with the data, though, it is here:
URL <- "https://raw.githubusercontent.com/DS4PS/cpp-523-spr-2020/master/labs/data/engineer-salaries.csv"
dat <- read.csv( URL, stringsAsFactors=F )
It is synthetic data created for the assignment. If you want to play with some real compensation data similar to that used in the chapter:
# compensation of nonprofit executives
URL <- "https://raw.githubusercontent.com/DS4PS/cpp-523-spr-2020/master/labs/data/np-comp-data.csv"
dat <- read.csv( URL, stringsAsFactors=F )
Data Dictionary:
This is great. Thank you for taking the time to post it. I'll give it a try while I'm completing my assignment.
No problem.
Side note, please leave the issues open because once closed other students can't access them.
Oops, sorry about that. I thought it would mark it as like "resolved" so they knew that there was an answer to the question.
No problem!
I know that we don't need the source data for Lab05 to answer the questions but will we get access to it so that we can try to crunch some of this stuff in R if we want?
Sorry if this is a dumb question (or if the point of the lab is specifically to focus on the formulas and not the data).