apache / datafusion-python

Apache DataFusion Python Bindings
https://datafusion.apache.org/python
Apache License 2.0
320 stars 63 forks source link

EPIC: Add all `SessionContext` and `DataFrame` methods to Python API #24

Open andygrove opened 1 year ago

andygrove commented 1 year ago

The Python bindings currently only expose a subset of functionality, and we want to expose as much as possible.

Here is a list of all available rust methods. Note that there may be reasons why we don't want to expose some of these.

SessionContext

DataFrame

francis-du commented 1 year ago

@andygrove Hi Andy. Could you update this task list? If you have a time.

nkarpov commented 1 year ago

Noticing that join_on (supporting arbitrary expressions instead of keys) is not present in the list above.

I can't find a way to use the existing binding to join on an arbitrary expression

Should the Rust join_on have its own wrapper too? or should we have the existing join wrapper choose appropriately?

ami-navon commented 1 year ago

+1 register_json( )