data-doctors / kaggle-house-prices-advanced-regression-techniques

Repository for source code of kaggle competition: House Prices: Advanced Regression Techniques
41 stars 25 forks source link

Try the 1.13.4.3. Tree-based feature selection and also the Pipeline method #14

Open divivoma opened 7 years ago

divivoma commented 7 years ago

http://scikit-learn.org/stable/modules/feature_selection.html

Tree-based estimators (see the sklearn.tree module and forest of trees in the sklearn.ensemble module) can be used to compute feature importances, which in turn can be used to discard irrelevant features (when coupled with the sklearn.feature_selection.SelectFromModel meta-transformer):