h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.92k stars 2k forks source link

Deeplearning missing_value_handling does not implement meanImputation correctly. #11739

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

In working on deeplearning mojo, found that missing_values for categorical NAN is always set to the extra categorical level set aside during training. For numerical NAN, it is always filled 0 which is correct when standardize is set to True. Need to make sure this imputation method is consistent with other algos of H2O.

hasithjp commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-4862 Assignee: New H2O Bugs Reporter: Wendy State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A