Closed exalate-issue-sync[bot] closed 1 year ago
JIRA Issue Details
Jira Issue: PUBDEV-8004 Assignee: Tomas Fryda Reporter: Tomas Fryda State: Closed Fix Version: 3.32.1.6 Attachments: N/A Development PRs: Available
Linked PRs from JIRA
https://github.com/h2oai/h2o-3/pull/5344 https://github.com/h2oai/h2o-3/pull/5631
AutoML can have hard time with datasets with high cardinality columns, e.g., Albert[1]. One of the reasons is DeepLearning that one-hot encodes the dataset yielding over 1M columns.
[1] https://www.openml.org/d/41147