h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.92k stars 2k forks source link

Add bootstrapping functionality #9349

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

Add the ability to bootstrap on an H2O frame so that one could train a model on a sample with replacement and validate on a sample with replacement repeatedly. This would allow users to calculate standard error on performance metrics. A description of this process can be found here: https://machinelearningmastery.com/calculate-bootstrap-confidence-intervals-machine-learning-results-python/

It would be nice to have the ability to not only perform this bootstrapping on metrics automatically calculated by H2O-3, but also be able to do this on custom metrics. I think this could be possible by allowing the user access to the sampled validation frame. They can then apply their custom validation function on the frame.

h2o-ops commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-6272 Assignee: New H2O Bugs Reporter: Megan Kurka State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A