Open mariusz-jachimowicz-83 opened 7 years ago
This project is worth a look. It provides a compiler/optimizer for Datalog and some SQL subset to multiple backends, including Spark Dataframes and Flink. https://github.com/uwescience/raco Something like that would subsume #708.
Another, somewhat larger effort, is a SQL DSL ontop of Onyx. Again, something like HoneySQL. We could write a library that reads a data structure representing SQL and output an Onyx job. This is a well studied topic that could be implemented directly from an academic paper.
(source slack conversation )