h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.86k stars 1.99k forks source link

Using col.names with group by (in R h2o.group_by()) has no effect on resulting H2OFrame's column names #9068

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

Using h2o.group_by() with col.names inside gb.control (R client) has no effect on resulting H2OFrame's column names.

{code:r} library(h2o) h2o.init() cars.hex = as.h2o(mtcars)

group by with default column names

h2o.group_by(cars.hex, by="cyl", sum(wt), sum(hp))

group by with column names using gb.control

h2o.group_by(cars.hex, by="cyl", sum("wt"), sum("hp"), gb.control = list(col.names=c("wt","hp")))

no effect on column names

no effect again when using 3 names:

h2o.group_by(cars.hex, by="cyl", sum("wt"), sum("hp"), gb.control = list(col.names=c("cyl","wt","hp")))

and no effect if using list instead of c

h2o.group_by(cars.hex, by="cyl", sum("wt"), sum("hp"), gb.control = list(col.names=list("cyl","wt","hp"))) h2o.group_by(cars.hex, by="cyl", sum("wt"), sum("hp"), gb.control = list(col.names=list("wt","hp"))) {code}

h2o-ops commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-6563 Assignee: New H2O Bugs Reporter: Gregory Kanevsky State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A