Imbalanced-learn 1.X - Githubissues

scikit-learn-contrib / imbalanced-learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

MIT License

6.85k stars 1.29k forks source link

I agree with that hierarchy. Since, the literature distinguish the methods mostly in data level approaches and algorithm level approaches samplers and predictors make totally sense. There are also methods that tackle the problem modifying the feature space. We could add those in the preprocessing module when we have such an implementation.

I believe that we should always import from the second level like this

from imblearn.predictors import BalancedRandomForest
from imblearn.samplers import RandomUnderSampler

An option could be to get rid different base classes and rely to estimators tags. That might give as freedom to make changes more efficiently.

scikit-learn-contrib / imbalanced-learn

Imbalanced-learn 1.X #645