Closed djouallah closed 5 days ago
Nice. Looks like this will be added in 4.0.0. I will start planning for how SQLFrame will support this.
gentle ping as i see you fixed already schema
Yeah I was planning on starting to support the 4.0.0 API once it is actually released. Can you provide an example of how you want to use toArrow
with SQLFrame? I'm assuming it is for DuckDB right?
Copy data to Delta table using the python writer which accepted arrow table as input
Would you be doing this using DuckDB? I'm not sure what you mean by python writer.
Yes DuckDB
Thanks for the details @djouallah. I do intend to do this, but just not sure on the timeline right now. I'm still trying to cover more gaps in the current DataFrame API coverage first.
here is my pitch, let's early adopter have this pattern working, raw data --- sqlframe --- arrow--- delta/iceberg and you will get the remaining gaps in the fullness of time :)
Yeah I see your point. I'm good with allocating some time to this and see what the complexity looks like. I see the value in implementing this pattern. Going to fix some remaining issues and then start exploring this.
finally it was added to Spark
https://github.com/apache/spark/pull/45481