Possibility to fit models with different datasets

flennerhag / mlens

ML-Ensemble – high performance ensemble learning

MIT License

843 stars 108 forks source link

Yes that's easy to do with column selection. For instance, if you have 3 variables, we can concatenate the three datasets into a [n, 9] dimensional input array. We then create an ensemble that fits a predictor per each set of 3 features:

from sklearn.linear_model import LogisticRegression
from mlens.ensemble import SuperLearner
from mlens.preprocessing import Subset

ens = SuperLearner()
ens.add(estimators={"pipe-1": [LogisticRegression()],
                    "pipe-2": [LogisticRegression()],
                    "pipe-3": [LogisticRegression()]},
        preprocessing={"pipe-1": [Subset([0, 1, 2])],
                       "pipe-2": [Subset([3, 4, 5])],
                       "pipe-3": [Subset([6, 7, 8])]})
ens.add_meta(LogisticRegression())

Hope that helps!

flennerhag / mlens

Possibility to fit models with different datasets #124