pola-rs / polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust
https://docs.pola.rs
Other
30.54k stars 1.98k forks source link

Enable "partition_by" for "sink_parquet" function #19845

Open herrmann1981 opened 1 week ago

herrmann1981 commented 1 week ago

Description

I can currently write parquet files in a hive partition scheme using "write_parquet(file, partition_by=['a', 'b'])". The partition_by parameter is currently not available for the "sink_parquet" function. This would be a huge benefit for use to not need to collect the whole LazyFrame before writing it.

aut0clave commented 1 week ago

Related to #11500 and #15441