H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
I am trying to use H2OXGBoostClassifier with h2o-pysparkling-3.1 on a multi-node cluster. In the logs, I see messages like the following:
WARN hex.tree.xgboost.XGBoostExtension: Your system supports only minimal version of XGBoost (no GPUs, no multithreading)!
I have tried installing libgomp, gcc_linux-64, and llvm-openmp, but I am unable to get multicore utilization during model fitting. I'm not a Java expert, but it looks like there's no support for multithreading unless the underlying resource has GPU capabilities. Is this correct? Am I missing something that would enable multithreading? Thank you for taking a look!
I am trying to use H2OXGBoostClassifier with h2o-pysparkling-3.1 on a multi-node cluster. In the logs, I see messages like the following: WARN hex.tree.xgboost.XGBoostExtension: Your system supports only minimal version of XGBoost (no GPUs, no multithreading)! I have tried installing libgomp, gcc_linux-64, and llvm-openmp, but I am unable to get multicore utilization during model fitting. I'm not a Java expert, but it looks like there's no support for multithreading unless the underlying resource has GPU capabilities. Is this correct? Am I missing something that would enable multithreading? Thank you for taking a look!
JIRA link: https://h2oai.atlassian.net/browse/PUBDEV-9076