ContinuumIO / elm

Phase I & part of Phase II of NASA SBIR - Parallel Machine Learning on Satellite Data
http://ensemble-learning-models.readthedocs.io
43 stars 27 forks source link

Wrap GridSearchCV and RandomizedSearchCV for MLDataset (xarray data structures) #223

Open PeterDSteinberg opened 6 years ago

PeterDSteinberg commented 6 years ago

PR #192 added dask-searchcv (which I understand will be dask-ml soon) base classes to Elm - the base classes used in elm.model_selection.EaSearchCV. PR #192 improved the evo algorithm as well as xarray (xarray_filters.MLDataset) support (See also #221 in progress for cross validation and MLDataset issues). It should be easy to just take GridSearchCV and RandomizedSearchCV from dask-ml / dask-searchcv and use the sklearn mixins for xarray support (use EaSearchCV for guidance).

PeterDSteinberg commented 6 years ago

Hi @gbrener I assigned this one to you - seems like a good way to get familiar with newer Elm changes without too big of a file diff needed. This can happen post-AGU if needed.

PeterDSteinberg commented 6 years ago

These classes will need to implement the sampler logic of EaSearchCV (PR #221). Let's make:

Maybe some of what is now elm.model_selection.EaSearchCV can become a base class (inheriting from dask_searchcv.DaskSearchBase) and that base class can be a basis for: