H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
parameter _ignoredcolumns not available with R while available with Python and Flow
some missing parameters cause NPE (see PUBDEV-7492)
variables of being not factors/numeric cause ??java.lang.ArrayIndexOutOfBoundsException: Index 0 out of bounds for length 0?? (mentioned in PUBDEV-7492)
List of defects and one minor improvement to Cox PH found while preparing to demo functionality to a customer:
Cox PH docs list y as parameter defining dependent variable - correct name is _eventcolumn (see: [http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/coxph.html )|http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/coxph.html]
parameter _ignoredcolumns not available with R while available with Python and Flow
some missing parameters cause NPE (see PUBDEV-7492)
variables of being not factors/numeric cause ??java.lang.ArrayIndexOutOfBoundsException: Index 0 out of bounds for length 0?? (mentioned in PUBDEV-7492)
mistype in CoxPH R example: variable heart should be variable heart.hex (see: [http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/algo-params/stop_column.html):|http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/algo-params/stop_column.html]
{code:r}# import the heart dataset heart <- h2o.importFile("http://s3.amazonaws.com/h2o-public-test-data/smalldata/coxph_test/heart.csv") ...
train your model
coxph.h2o <- h2o.coxph(x=x, event_column=y, start_column=start, stop_column=stop, training_frame=heart.hex){code}