h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.78k stars 1.99k forks source link

Tool for estimating desired H2O cluster size based on input data #12892

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

We would like to have a mechanism/tool that would be able to estimate how big should H2O cluster be for a given dataset. This can be a wizard that asks what kind of algos will the user use (automl/gbm,...) and ask for the input dataset/datasets. Based on this input and environment restrictions (max memory per node/container) we will provide a guidance how to configure the cluster.

hasithjp commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-6045 Assignee: New H2O Bugs Reporter: Michal Kurka State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A