apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
https://paimon.apache.org/
Apache License 2.0
2.43k stars 955 forks source link

[common][core][format] Support using file index result for RowGroup filtering. #4473

Closed hang8929201 closed 1 week ago

hang8929201 commented 2 weeks ago

Purpose

Use file index result(bitmap) for more fine-grained IO clipping in the reader.

Tests

This is a query in our real business scenario.

API and Format

Documentation

JingsongLi commented 1 week ago

Left comments @hang8929201