pola-rs / polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust
https://docs.pola.rs
Other
30.49k stars 1.98k forks source link

Add modular parquet de-/encryption #19858

Open brainslush opened 3 days ago

brainslush commented 3 days ago

Description

Currently there are efforts to implement parquet de-/encryption into arrow-rs: https://github.com/apache/arrow-rs/pull/6637

I suggest to extend the existing scan,read,sink and write interfaces to handle encrypted parquet.

I offer to implement it.

coastalwhite commented 3 days ago

I quite agree with most of what is being said in the arrow-r/s thread:

If you want to take this up, I suggest you first made a PR with a rough draft.

rok commented 1 day ago

Glad to see this! https://github.com/apache/arrow-rs/pull/6637 will most likely be ring based. I'd be happy to help with reviews etc.