No n_features_to_select parameter

scikit-learn-contrib / boruta_py

Python implementations of the Boruta all-relevant feature selection method.

BSD 3-Clause "New" or "Revised" License

1.46k stars 252 forks source link

I know this doesnt directly answer your question. When I want to minimize the features I often do a feature reduction after the all relevant feature selection step. Forward or backward stepwise feature elimination depending on whether you want choose very few features or only drop a few respectively. I have also found that some simulated annealing helps a lot in practice.

This might help in practice because highly correlated features will all have high p values. So you might throw out features which are less statistically relevant but have more orthogonal value.

Sorry for the tangent but thought it might help

scikit-learn-contrib / boruta_py

No n_features_to_select parameter #92