Closed WeichenXu123 closed 7 years ago
cc @smurching @jkbradley
@smurching Most places update according to your feedback. I add a DocGenerator
for Tokenizer
.
Leave a issue solving MLParam
turning into DataFrame column broken, I will fix it tomorrow!
Thanks for kindly review!
FPGrowth
PR #113 pls also help review, thanks!
Update code against new MLParam
impl. thanks! cc @smurching @jkbradley
One more comment, copying Sid's comment from above: Could you also add params for your new tests to src/main/scala/configs/mllib-small.yaml?
LGTM except the 1 comment above. Since this is blocking some other tasks, I'll merge it. Can you please send a tiny PR after this to fix the remaining comment? Thanks!
Add benchmark for:
FPGrowth is working on will be added soon.
Part of code from https://github.com/smurching/spark-sql-perf/pull/1 and address feedbacks from @smurching