h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.9k stars 2k forks source link

frame.pivot() relax index column data type check #12124

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

https://github.com/h2oai/h2o-3/blob/c0f9ffef3b68e4727b1efe36c1c1111850519ee9/h2o-py/h2o/frame.py#L2542

requires both index and columns to be pivoted to be of following data types:

"enum","time","int"

What was motivation for this limitation?

It would be great to have at least index column not to have such a limitation.

When we convert Spark dataframes in pySparkling Water to h2o frames, we often have some columns with type string.

hasithjp commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-5252 Assignee: New H2O Bugs Reporter: Ruslan Dautkhanov State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A