OryxProject / oryx

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
http://oryx.io
Apache License 2.0
1.79k stars 405 forks source link

Add DataFrame, Dataset access #328

Closed srowen closed 5 years ago

srowen commented 7 years ago

The framework gives applications access to RDDs of data. Now that DataFrames and Datasets are stable and mature, the framework should expose some kind of DataFrame-based access to data, as well as Dataset if possible.