UBC-MDS / DSCI_522_Group-308_Used-Cars

This project attempts to build a regression model to predict price of used cars based on numerous features of the car
MIT License
2 stars 6 forks source link

Report feedback #38

Open ksedivyhaley opened 4 years ago

ksedivyhaley commented 4 years ago

"For each model we performed 5-fold-cross-validated grid search involving a range of most important model-specific hyper-parameters."

Brief explanation of why this method is appropriate.

"Since training and validating took a lot of resources, we performed it on a gradually increasing subsets of training data."

You increased the subset for each model, or you tested a few times for each model with the same, increasing subsets?

"Since SVM shown the best results from the very beginning, we performed a thorough adaptive grid search on a bigger subset of 200,000 observations (running for 4 hours) resulting in 81.3% accuracy on validation data."

You just showed a table with an 0.813 validation score. It looks like this should be the "very beginning" result, but if this is the case then you should have a different number for the bigger subset.

"Eg., having an RMSE almost twice higher than MAE suggests that there is a good number of observations where the error is big (the more RMSE differs from MAE, the higher is the variance)"

What are the values for RMSE and MAE?