h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.79k stars 1.99k forks source link

Minor defects and improvements found for Cox PH survival #8141

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

List of defects and one minor improvement to Cox PH found while preparing to demo functionality to a customer:

Cox PH docs list y as parameter defining dependent variable - correct name is _eventcolumn (see: [http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/coxph.html )|http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/coxph.html]

parameter _ignoredcolumns not available with R while available with Python and Flow

some missing parameters cause NPE (see PUBDEV-7492)

variables of being not factors/numeric cause ??java.lang.ArrayIndexOutOfBoundsException: Index 0 out of bounds for length 0?? (mentioned in PUBDEV-7492)

mistype in CoxPH R example: variable heart should be variable heart.hex (see: [http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/algo-params/stop_column.html):|http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/algo-params/stop_column.html]

{code:r}# import the heart dataset heart <- h2o.importFile("http://s3.amazonaws.com/h2o-public-test-data/smalldata/coxph_test/heart.csv") ...

train your model

coxph.h2o <- h2o.coxph(x=x, event_column=y, start_column=start, stop_column=stop, training_frame=heart.hex){code}

  1. add “Survival Analysis” term to Cox PH docs (it’s only occurring in references and just once).
h2o-ops commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-7497 Assignee: Ondrej Nekola Reporter: Gregory Kanevsky State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A