apache / datafusion-python

Apache DataFusion Python Bindings
https://datafusion.apache.org/python
Apache License 2.0
321 stars 63 forks source link

Expose `register_listing_table` #617

Closed henrifroese closed 2 months ago

henrifroese commented 3 months ago

I want to query hive-partitioned parquet files in S3 using the python DataFusion client. Currently, pyarrow datasets are already supported, but I've found performance to be lacking. Instead, I'd rather use object_store. The python bindings already support creating object stores, so we only need to expose register_listing_table for this.