H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Using
h2o.group_by()
withcol.names
insidegb.control
(R client) has no effect on resulting H2OFrame's column names.{code:r} library(h2o) h2o.init() cars.hex = as.h2o(mtcars)
group by with default column names
h2o.group_by(cars.hex, by="cyl", sum(wt), sum(hp))
group by with column names using gb.control
h2o.group_by(cars.hex, by="cyl", sum("wt"), sum("hp"), gb.control = list(col.names=c("wt","hp")))
no effect on column names
no effect again when using 3 names:
h2o.group_by(cars.hex, by="cyl", sum("wt"), sum("hp"), gb.control = list(col.names=c("cyl","wt","hp")))
and no effect if using
list
instead ofc
h2o.group_by(cars.hex, by="cyl", sum("wt"), sum("hp"), gb.control = list(col.names=list("cyl","wt","hp"))) h2o.group_by(cars.hex, by="cyl", sum("wt"), sum("hp"), gb.control = list(col.names=list("wt","hp"))) {code}