StarRocks / starrocks

StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
https://starrocks.io
Apache License 2.0
8.65k stars 1.75k forks source link

Parquet Dictionary Encoding Configuration #42467

Open hanyunYep opened 6 months ago

hanyunYep commented 6 months ago

Enhancement

Currently, when writing to a Parquet file, the Dictionary Encoding defaults to RLE_DICTIONARY, even when the hive table parameter parquet.enable.dictionary=false is configured or BE parameter column_dictionary_key_ratio_threshold=0 is configured. It is desirable to be able to configure the use of Dictionary Encoding or use formats such as PLAIN_DICTIONARY or BIT_PACKED.

StarRocks Version 3.2.2

hanyunYep commented 6 months ago

image

github-actions[bot] commented 3 days ago

We have marked this issue as stale because it has been inactive for 6 months. If this issue is still relevant, removing the stale label or adding a comment will keep it active. Otherwise, we'll close it in 10 days to keep the issue queue tidy. Thank you for your contribution to StarRocks!