Python machine learning library based on Object Oriented design principles; the goal is to allow users to quickly explore data and search for top machine learning algorithm candidates for a given dataset
[x] Resampler (this is quite difficult because there are various spots (e.g. callback) that don't fit into the parallelization scheme e.g. don't pickle)
[ ] ModelStacker _(this is more complicated than it appears (or compared with Aggregator) because of the "blending" of model predictions into the final train_meta data-frame)_
[x] update Jupyter notebook examples
Note: some of the above may be solved through others (e.g. ModelSearcher via Tuner)
I imagine there are significant opportunities to speed up code via parallelization.
http://sebastianraschka.com/Articles/2014_multiprocessing.html http://www.paulbrownmagic.com/blog/python_functional_iteration/
train_meta
data-frame)_Note: some of the above may be solved through others (e.g. ModelSearcher via Tuner)