pixelsdb / pixels

An efficient storage and compute engine for both on-prem and cloud-native data analytics.
GNU Affero General Public License v3.0
139 stars 37 forks source link

[pixels-executor] improving filtering performance, especially on string columns. #391

Open bianhq opened 1 year ago

bianhq commented 1 year ago

Currently, filtering on string columns is much slower than that in Presto and Trino.

bianhq commented 1 year ago

Filters should be pushed down into the column readers so that for dictionary-encoded string columns, we can filter before decoding.