apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
https://paimon.apache.org/
Apache License 2.0
2.43k stars 955 forks source link

[core] Make default of 'lookup.local-file-type' to sort #4500

Closed JingsongLi closed 1 week ago

JingsongLi commented 1 week ago

Purpose

Sort has better performance and compression ratio, so we should consider switching to sort by default.

Tests

API and Format

Documentation

Aitozi commented 1 week ago

I prefer to have more benchmark between the sort and hash format before making the sort format as default.

JingsongLi commented 1 week ago

I prefer to have more benchmark between the sort and hash format before making the sort format as default.

Thanks! Expect more input from you~