H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
The various gradle build jobs (gradle_build, gradle_build_commit_only, gradle_build_J8) are INTERMITTENT with common error:
INFO: ###########################################################
INFO: Test class name: water.parser.ParserTest
INFO: Test method name: testStrings
INFO: ###########################################################
INFO: Total file size: 3.6 MB
INFO: Parse chunk size 4194304
2015-09-24 12:13:55.825:INFO:oejsh.ContextHandler:stopped o.e.j.s.ServletContextHandler{/,null}
ERRR: Out of Memory, Heap Space exceeded, increase Heap Size, from /172.16.2.172:43018
FATAL: Exiting.
ERRR: Error leading to a cloud kill from /172.16.2.172:43018
FATAL: Exiting.
Of note, the number of cpus on the machines where these jobs are run vary from 4 to 8 to 24.
I'm going to bump up Xmx on h2o-core/testMultiNode.sh as an interim fix, but this issue needs to be investigated more in the future
The various gradle build jobs (gradle_build, gradle_build_commit_only, gradle_build_J8) are INTERMITTENT with common error:
INFO: ########################################################### INFO: Test class name: water.parser.ParserTest INFO: Test method name: testStrings INFO: ########################################################### INFO: Total file size: 3.6 MB INFO: Parse chunk size 4194304 2015-09-24 12:13:55.825:INFO:oejsh.ContextHandler:stopped o.e.j.s.ServletContextHandler{/,null} ERRR: Out of Memory, Heap Space exceeded, increase Heap Size, from /172.16.2.172:43018 FATAL: Exiting. ERRR: Error leading to a cloud kill from /172.16.2.172:43018 FATAL: Exiting.
Of note, the number of cpus on the machines where these jobs are run vary from 4 to 8 to 24.
I'm going to bump up Xmx on h2o-core/testMultiNode.sh as an interim fix, but this issue needs to be investigated more in the future