Open sabbatinif opened 3 years ago
What feature selection criteria for regression are of interest?
I have no strong preferences about the criteria. I can suggest something similar to Python SciKit-Learn's feature_selection.f_regression
. It consists of a sequential algorithm aimed at iteratively and greedly selecting the most relevant features of a dataset. It starts by training a temporary regressor on a single feature (the most correlated with respect to the output values) and it keeps repeating this operation by adding one feature at a time, always peaking the one that mostly increases the temporary regressor predictive performance. At the end of this process, features are ranked on the basis of their relevance. But any other criteria is useful for me
It may be useful to have a feature ranking procedure applicable not only to classification tasks (e.g.
SignalNoiseRatio
andSumSquaresRatio
, implementing theFeatureRanking
interface), but also to regression tasks. At the moment theFeatureRanking
interface only accepts integer target vectors for calculating the feature rank.