apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
https://paimon.apache.org/
Apache License 2.0
2.35k stars 928 forks source link

[core] Parquet becomes default file format #3663

Closed JingsongLi closed 3 months ago

JingsongLi commented 3 months ago

Purpose

At present, Parquet is relatively excellent in terms of compression rate and analytical performance. It is time to use parquet by default.

Tests

All tests cover default format.

API and Format

Old tables use orc too, old new table use parquet.

Documentation