datafusion-contrib / datafusion-java

Java binding to Apache Arrow DataFusion
Apache License 2.0
66 stars 10 forks source link

Expose DataFusion Dataframe API #144

Open cbiggar opened 4 months ago

cbiggar commented 4 months ago

It appears as though the Java bindings only expose a SQL string as a query interface.

This makes programmatic querying of the DataFusion engine hard as it would rely on translating our internal dataframe-like AST into a SQL string first. This feels brittle as it introduces a whole class of bugs related to SQL syntax errors which aren't a concern when translating from an internal dataframe representation to Datafusion's dataframe representation.

I can see that the Rust Native DataFusion API contains a Dataframe API.

Would you consider exposing this in the java bindings too?