databricks / spark-sql-perf

Apache License 2.0
586 stars 406 forks source link

[ML-2847] Add new tests for (DecisionTree, RandomForest)Regression, GMM, HashingTF #116

Closed smurching closed 7 years ago

smurching commented 7 years ago

This PR follows up on #112, adding new performance tests for DecisionTreeRegression, RandomForestRegression, GMM, and HashingTF.

Summary of changes:

I tested this PR by running the performance tests specified in mllib-small.yaml

smurching commented 7 years ago

This PR looks kinda big since I alphabetized the configs in mllib-small.yaml (to help keep track of which tests were being run). Let me know if it'd be helpful for me to de-alphabetize them/just commit the new test configs I've added.

jkbradley commented 7 years ago

LGTM I ran it locally Merging with master Thanks @smurching and @WeichenXu123 !