h2oai / sparkling-water

Sparkling Water provides H2O functionality inside Spark cluster
https://docs.h2o.ai/sparkling-water/3.3/latest-stable/doc/index.html
Apache License 2.0
967 stars 360 forks source link

Reconsider the role and naming conventions of the 'columnsToCategorical' and 'allStringColumnsToCategorical' properties #4525

Closed exalate-issue-sync[bot] closed 1 year ago

exalate-issue-sync[bot] commented 1 year ago

Such properties are usually expressed as separate transfromers/estimators in Apache Spark. So reconsider current implementation of these algorithm properties.

Eventually, think about shorter names for these properties.

exalate-issue-sync[bot] commented 1 year ago

Jakub Hava commented: The ultimate goal should be to reach to Spark API with setFeatureCol() where these options are no longer necessary.

DinukaH2O commented 1 year ago

JIRA Issue Migration Info

Jira Issue: SW-1231 Assignee: UNASSIGNED Reporter: Marek Novotny State: In Progress Fix Version: N/A Attachments: N/A Development PRs: Available

Linked PRs from JIRA

https://github.com/h2oai/sparkling-water/pull/1175

hasithjp commented 1 year ago

JIRA Issue Migration Info Cont'd

Jira Issue Created Date: 2019-04-23T04:52:09.936-0700