h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.92k stars 2k forks source link

PSVM giving NaNs for some metrics #8323

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

This issue was found in R with binary classification, but should also be checked in Python too:

{code:r}library(h2o) h2o.init() train <- h2o.importFile("https://s3.amazonaws.com/erin-data/higgs/higgs_train_10k.csv") test <- h2o.importFile("https://s3.amazonaws.com/erin-data/higgs/higgs_test_5k.csv") y <- "response" x <- setdiff(names(train), y)

train[,y] <- as.factor(train[,y]) test[,y] <- as.factor(test[,y]) psvm <- h2o.psvm(x = x, y = y, training_frame = train) h2o.performance(psvm, test){code}

Gives NaNs:

H2OBinomialMetrics: psvm MSE: 0.43 RMSE: 0.6557439 LogLoss: NaN Mean Per-Class Error: 0.4269508 AUC: NaN AUCPR: NaN Gini: NaN

h2o-ops commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-7314 Assignee: Michal Kurka Reporter: Erin LeDell State: Open Fix Version: Backlog Attachments: N/A Development PRs: Available

Linked PRs from JIRA

https://github.com/h2oai/h2o-3/pull/4341