h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.92k stars 2k forks source link

SVD using netlib-java for PCA #11264

Closed exalate-issue-sync[bot] closed 1 year ago

exalate-issue-sync[bot] commented 1 year ago

H2O provides implementation of the PCA algorithm which depends on the Jama library. The library is used for several tasks including Singular Value Decomposition (SVD). However, the library also introduces sub-optimal performance.

The idea is to replace Jama SVD computation by the netlib-java library [MTJ|https://github.com/fommil/matrix-toolkits-java], and measure performance impact.

This task is based on assignment [Improve H2O PCA|https://gist.github.com/mmalohlava/717ad7b7441a6ff91b5f0a907482bd5d]

h2o-ops commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-4375 Assignee: Terry Tang Reporter: Karel Ha State: Resolved Fix Version: 3.20.0.1 Attachments: N/A Development PRs: Available

Linked PRs from JIRA

https://github.com/h2oai/h2o-3/pull/1827 https://github.com/h2oai/h2o-3/pull/1025