H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Even though RF doesn't have a "distribution" param, we are setting it automatically in AutoML. However, we are setting both binomial and multinomial as "multinomial". We should add an extra if statement to make these separate cases (even though the RF doesn’t use this info). This is confusing because the resulting RF model has the wrong distribution set when users are inspecting the params of the model, after-the-fact.
Current logic:
{code:java}if (_parms._distribution == DistributionFamily.AUTO) {
if (_nclass == 1) _parms._distribution = DistributionFamily.gaussian;
if (_nclass >= 2) _parms._distribution = DistributionFamily.multinomial;
}{code}
Even though RF doesn't have a "distribution" param, we are setting it automatically in AutoML. However, we are setting both binomial and multinomial as "multinomial". We should add an extra if statement to make these separate cases (even though the RF doesn’t use this info). This is confusing because the resulting RF model has the wrong distribution set when users are inspecting the params of the model, after-the-fact.
Current logic:
{code:java}if (_parms._distribution == DistributionFamily.AUTO) { if (_nclass == 1) _parms._distribution = DistributionFamily.gaussian; if (_nclass >= 2) _parms._distribution = DistributionFamily.multinomial; }{code}