Support cross_val_predict for stacked estimators

scikit-learn-contrib / skdag

A more flexible alternative to scikit-learn Pipelines

MIT License

30 stars 8 forks source link

First -- this library looks great, pretty much exactly what I was looking for!

It appears though, that each dependent estimator is trained on the entire passed dataset directly. This can lead to overfitting. Instead, it would be nice if cross_val_predict was supported in some fashion so when we have estimators A -> B, B is trained on output of cross_val_predict rather than raw predict output.

Given activity in this repo, I'm assuming its basically inactive. But if not, I'd be happy to contribute such functionality.

scikit-learn-contrib / skdag

Support cross_val_predict for stacked estimators #27