apache / datafusion-ballista

Apache DataFusion Ballista Distributed Query Engine
https://datafusion.apache.org/ballista
Apache License 2.0
1.39k stars 181 forks source link

Avoid writing schema to shuffle files #944

Open Dandandan opened 6 months ago

Dandandan commented 6 months ago

Is your feature request related to a problem or challenge? Please describe what you are trying to do. In https://github.com/apache/arrow-ballista/pull/943 we switched to using the StreamWriter instead of FileWriter. However, this still writes the schema to disk.

Describe the solution you'd like Not write the schema to disk, but just the data/buffers.

Describe alternatives you've considered

Additional context