datafusion-contrib / datafusion-objectstore-s3

S3 as an ObjectStore for DataFusion
Apache License 2.0
59 stars 13 forks source link

Create python bindings #34

Open matthewmturner opened 2 years ago

jimexist commented 2 years ago

i guess this can be combined in datafusion-python repo

matthewmturner commented 2 years ago

@Jimexist that would certainly decrease time to market. In the short run I think its a great idea.

and just to confirm, are you referring to make it part of the same datafusion python package? or a separate package like datafusion-s3?

@seddonm1 any thoughts?

seddonm1 commented 2 years ago

Sorry I have zero knowledge of the Python bindings.

The only benefit of separating it from the datafusion-python repo may be if users don't want to install certain components (for example a GCP user may not need S3)?

jimexist commented 2 years ago

something we can consider is to make sure this repo's python API is compatible with the one from datafusion-python, and then allow it to be installed via extras:

In this way, users can do:

pip install datafusion datafusion[s3] datafusion[gcp]
matthewmturner commented 2 years ago

@Jimexist i think thats a good idea. I will look into it.

chitralverma commented 2 years ago

+1 for this!

schapala-hm commented 2 years ago

+1 for this