Closed rishic3 closed 1 month ago
Update the outdated Scala PCA example to use Python-based Spark-RAPIDS-ML. Minor changes to Scala dataset for speedup demonstration (using 100k rows vs. 50k rows, using float32 vs. float64).
Ideally, this PR should also delete the legacy pca related code.
Update the outdated Scala PCA example to use Python-based Spark-RAPIDS-ML. Minor changes to Scala dataset for speedup demonstration (using 100k rows vs. 50k rows, using float32 vs. float64).