h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.78k stars 1.99k forks source link

AutoML modes #7583

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

New parameter for AutoML: {{mode}}

Motivation: We need to be able to compete more strongly in competitions, however some of the techniques we’d do when competing on the basis of model accuracy, are not what we normally do when we first approach a problem and/or build models for production. So we offer a new parameter to switch between modes, where the default is the current AutoML algorithm. This is the “presets” idea we have been planning for a while.

Initial options: {{["explore, "compete"]}}

We plan to add one or more other options in the future, but for right now, this will suffice.

Technical notes:

In compete mode, the extra stacked ensembles can use model ids that get indexed like the other algos. We have some options:

Related tasks:

h2o-ops commented 1 year ago

JIRA Issue Details

Jira Issue: PUBDEV-8066 Assignee: Sebastien Poirier Reporter: Erin LeDell State: Open Fix Version: Backlog Attachments: N/A Development PRs: Available

h2o-ops commented 1 year ago

Linked PRs from JIRA

https://github.com/h2oai/h2o-3/pull/5394