Open marcdotson opened 3 years ago
A recommended reference from @adam-n-smith.
Let's get clear on the sources of data:
Vehicle 1 Make
to Vehicle 8 Year
).REC_Q1
).@cwjohnson1 and @z-wix, let's do some exploratory data analysis of the ownership and recontact survey. Use the model-validation
branch and 02_exploratory-data-analysis.R
script.
I just created two pull requests for some of the code I've been working on. Let me know both your thoughts / questions as well as what the plans are in proceeding. Thanks!
I know we mentioned wanting to compare how participants said they would purchase vs. how they actually purchased. I can do some analyses on that next. Are there any other thoughts?
@cwjohnson1 no need to create a separate branch -- please just work in model-validation
. I've merged your changes back into this branch. I'm digging into the changes now and will provide updates in our weekly meeting.
Here's a sketch of how to use the ownership and recontact survey data as a validation task.
There are a lot of things to figure out in here in terms of matching respondents to their in-sample and hold-out data, recoding open-ends and checking for spelling mistakes, and conditioning just on the brand and year attributes.
I just realized that I committed the plots, but never pushed them. Sorry about that. you should be able to find them on the Sawtooth-2021.Rmd now.
@cwjohnson1 please don't create new branches. You can add this all to model-validation
.
Notes on computing predictive fit using the validation task:
I just uploaded 2 new plots to the Sawtooth-2021.RMD and am working on some more. I know Zach was working with the recontact data, but since he's working on another project now, I can also plot some visualizations for those data as well if you'd like.
Please do, @cwjohnson1.
Questions about constructing a validation task from recontact/ownership data:
Short-term options:
Long-term options:
I just added the code for the ownership data visualizations, like we talked about, to the presentation folder under the model validation branch. The code for the recontact visualizations are found in 02_exploratory-data-analysis.R. Would you like me to add that code as well for the sake of finding it easier?
No, that's fine. Thanks!
Note that the validation data is car ownership data, not car purchase data.