h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.94k stars 2k forks source link

Gradle build jenkins jobs INTERMITTENT fail on OOM #15060

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago

The various gradle build jobs (gradle_build, gradle_build_commit_only, gradle_build_J8) are INTERMITTENT with common error:

INFO: ########################################################### INFO: Test class name: water.parser.ParserTest INFO: Test method name: testStrings INFO: ########################################################### INFO: Total file size: 3.6 MB INFO: Parse chunk size 4194304 2015-09-24 12:13:55.825:INFO:oejsh.ContextHandler:stopped o.e.j.s.ServletContextHandler{/,null} ERRR: Out of Memory, Heap Space exceeded, increase Heap Size, from /172.16.2.172:43018 FATAL: Exiting. ERRR: Error leading to a cloud kill from /172.16.2.172:43018 FATAL: Exiting.

Of note, the number of cpus on the machines where these jobs are run vary from 4 to 8 to 24.

I'm going to bump up Xmx on h2o-core/testMultiNode.sh as an interim fix, but this issue needs to be investigated more in the future

DinukaH2O commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-2142 Assignee: Brandon Hill Reporter: Eric Eckstrand State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A