lensacom / sparkit-learn

PySpark + Scikit-learn = Sparkit-learn
Apache License 2.0
1.15k stars 255 forks source link

[RFC] Scikit interface for the `ml` and `mllib` packages #54

Open kszucs opened 9 years ago

kszucs commented 9 years ago

PySpark machine learning packages are getting more robust. It is considerable to make use of the already implemented distributed algorithms through an sklearn compatible interface instead of porting the non-distributed ones.