Proposed Solution (1.25/1.25)-
Some details missing - Train/test split, what kind of cross-validation you plan to use based on your data, and why.
Minor point - Fix formatting on the logistic regression formula.
Otherwise good.
Metrics (1.25/1.25) -
Good!
Preliminary Results (1.5/1.5) -
I would add markdown cells between cleaning, and wrangling steps explaining each step - it's hard for a grader to read through all the lines of code to determine what you are actually doing.
I would also add some EDA - visualizations showing the distribution of your data, target variable, and correlation heatmaps with your target variable to gain a sense of what your data looks like, and the correlation between features before jumping into Model Selection.
Good work on Model Selection - Add a table at the end comparing all 4 of your models and their performance on different evaluation metrics so we get an overall picture of your model performance.
Ethics and Privacy (0.5/0.5) - OK. Keep adding to this section as you continue your analysis.
Team Expectations (0.25/0.25)
Timeline (0.25/0.25) -
Other Comments - Great work on this checkpoint - I can see that you guys have incorporated feedback from the proposal as well. You're on the right track :)
You can reply to this feedback below. Contact me anytime if you want help improving your project or have any questions at all!
Project Checkpoint Grade: 9/9
Title and Abstract(0.75/0.75) - Good!
Background (1/1) - Good!
Problem Statement (1/1) - Good!
Data (1.25/1.25) - Good!
Proposed Solution (1.25/1.25)- Some details missing - Train/test split, what kind of cross-validation you plan to use based on your data, and why. Minor point - Fix formatting on the logistic regression formula. Otherwise good.
Metrics (1.25/1.25) - Good!
Preliminary Results (1.5/1.5) - I would add markdown cells between cleaning, and wrangling steps explaining each step - it's hard for a grader to read through all the lines of code to determine what you are actually doing. I would also add some EDA - visualizations showing the distribution of your data, target variable, and correlation heatmaps with your target variable to gain a sense of what your data looks like, and the correlation between features before jumping into Model Selection. Good work on Model Selection - Add a table at the end comparing all 4 of your models and their performance on different evaluation metrics so we get an overall picture of your model performance.
Ethics and Privacy (0.5/0.5) - OK. Keep adding to this section as you continue your analysis.
Team Expectations (0.25/0.25)
Timeline (0.25/0.25) -
Other Comments - Great work on this checkpoint - I can see that you guys have incorporated feedback from the proposal as well. You're on the right track :)
You can reply to this feedback below. Contact me anytime if you want help improving your project or have any questions at all!