trinodb / trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
https://trino.io
Apache License 2.0
10.47k stars 3.01k forks source link

Trino Parquet Column Encryption #9383

Open shangxinli opened 3 years ago

shangxinli commented 3 years ago

Parquet modular encryption provides a column encryption feature. It would be great to have that in Presto. Due to the slowness of the PresteoDB, I would like to start the feature in TrinoDB. The original issue in PrestoDB can be found here.

It is a large feature and it can be divided into three parts: 1) Common code path 2) Decryption 3) Encryption. The command code path change and decryption change draft can be found here.

xiacongling commented 2 years ago

Hi, @shangxinli . I noticed that the previous PR has been closed, is there any progress on Parquet column encryption?

amoghmargoor commented 1 year ago

We at Apple have been running PME for a while now and would like to contribute the decryption support in Trino for it. We will be raising the draft PR soon for this. cc @vgankidi

amoghmargoor commented 1 year ago

Design doc for the decryption support: https://docs.google.com/document/d/1hve3UvT2oa6ud1iWifI6KH2dJpSjpg62svy6ES-cjcM/edit?usp=sharing

my7ym commented 11 months ago

@amoghmargoor Do you folks have any timeline to deliver the PME support for Trino? Thanks!

hashhar commented 2 months ago

It seems this is no longer being actively worked on.

Feel free to re-open if needed.

sopel39 commented 2 months ago

I thin we should reopen it. cc @amoghmargoor

amoghmargoor commented 1 month ago

I have just opened a new PR for this rebased, PR is WIP as we are trying to add test cases for the encryption. https://github.com/trinodb/trino/pull/23583. This will have many newer files compared to existing PRs due to parquet-hadoop module being decoupled.