AdaBoostStumpsSampler - Githubissues

glevv commented 3 years ago

MC approximation of AdaBoost stump kernel #119

TimotheeMathieu commented 3 years ago

Thank you @GLevV for this, some comments:

Could you include a short description and explanation of the difference between this and fastfood in the documentation ?
Also, I think on some datasets this may not be a good idea to use a scaling of 1/max(|x|) as this is can vary a lot. More generally I personally would use one of scikit-learn scalers on the data (for instance StandardScaler, what you propose if MinMaxScaler). Maybe you could include a scale_X which can be True or False among the parameters and by default this would use the StandardScaler which is the most common scaler ? THe best preprocessing really depends on the dataset so it should not be fixed in the algorithm.

Otherwise LGTM, thanks.

glevv commented 3 years ago

They are completely different kernels and methods of computing them. Stump kernel was presented in Support Vector Machinery for Infinite Ensemble Learning, but it could be hard to compute exactly, so in the paper Uniform Approximation of Functions with Random Bases MC approximation was proposed (the same paper where MC approximation of RBF kernel - RBFSampler - is described). I think that StumpKernelSampler/StumpSampler should be shorter and more consistent name (similar to RBFSampler).

As for the scaling, I think it is possible to remove it altogether and let users build their own pipelines (obviously stating in the docs that this method requires scaling). It will be consistent with other kernel methods/approximations (RBFSampler also requires scaling to give proper approximation) and original formulation in the paper.

glevv commented 1 year ago

Closed due to inactivity

scikit-learn-contrib / scikit-learn-extra

AdaBoostStumpsSampler #124