lensacom / sparkit-learn

PySpark + Scikit-learn = Sparkit-learn
Apache License 2.0
1.15k stars 255 forks source link

Cassandra support? #1

Closed ghost closed 9 years ago

ghost commented 9 years ago

Looks great. Can this support pulling RDD's from Cassandra with CQL ?

kszucs commented 9 years ago

Hey Ged,

Thanks! This project aims to support scikit-learn on spark, at the moment Cassandra support is out of our scope. For loading Cassandra data to an RDD we suggest to try one of the following libraries:

After importing you can pass the RDD to sparkit-learn.