pola-rs / polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust
https://docs.pola.rs
Other
30.25k stars 1.95k forks source link

sink_parquet_all support #18364

Open EpicUsaMan opened 2 months ago

EpicUsaMan commented 2 months ago

Description

Currently there is no possibility to write data on disk without materializing it when dealing with many lazy frames which are nested from one lazy frame

The only possible way is to mimic it by using collect_all_async and hoping for the best

cmdlineluser commented 2 months ago

https://github.com/pola-rs/polars/issues/18235#issuecomment-2295187859

Our new streaming engine will be able to this. It will be able to multiplex and to multiple intermediate/output nodes.