apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
https://paimon.apache.org/
Apache License 2.0
2.1k stars 834 forks source link

[parquet] Parquet supports column index filter #3610

Closed JingsongLi closed 2 days ago

JingsongLi commented 6 days ago

Purpose

Tests

API and Format

Documentation

JingsongLi commented 5 days ago

https://issues.apache.org/jira/browse/SPARK-34859

JingsongLi commented 5 days ago

Don't not merge this. It seems we need to have more works to do.

JingsongLi commented 2 days ago

Also should consider row position of returned iterator.