onyx-platform / onyx

Distributed, masterless, high performance, fault tolerant data processing
http://www.onyxplatform.org
Eclipse Public License 1.0
2.05k stars 205 forks source link

SQL DSL #707

Open mariusz-jachimowicz-83 opened 7 years ago

mariusz-jachimowicz-83 commented 7 years ago

Another, somewhat larger effort, is a SQL DSL ontop of Onyx. Again, something like HoneySQL. We could write a library that reads a data structure representing SQL and output an Onyx job. This is a well studied topic that could be implemented directly from an academic paper.

(source slack conversation )

gardnervickers commented 7 years ago

This project is worth a look. It provides a compiler/optimizer for Datalog and some SQL subset to multiple backends, including Spark Dataframes and Flink. https://github.com/uwescience/raco Something like that would subsume #708.