NVIDIA / spark-rapids-examples

A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.
Apache License 2.0
130 stars 51 forks source link

Update Spark-RAPIDS-ML PCA #440

Closed rishic3 closed 1 month ago

rishic3 commented 2 months ago

Update the outdated Scala PCA example to use Python-based Spark-RAPIDS-ML. Minor changes to Scala dataset for speedup demonstration (using 100k rows vs. 50k rows, using float32 vs. float64).

eordentlich commented 1 month ago

Ideally, this PR should also delete the legacy pca related code.