apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
https://paimon.apache.org/
Apache License 2.0
2.43k stars 954 forks source link

【Hive】Change the table update calling method to alter_table_with_environmen… #4548

Closed GangYang-HX closed 2 days ago

GangYang-HX commented 2 days ago

Purpose

When metastore=hive, if the number of table fields is large (1600+) and there are many files, MetaStoreUtils.updateTableStatsSlow is executed by default to update table statistics, but Paimon has its own statistics and does not need to rely on hive.

Therefore, an option is provided to control whether to update Hive table statistics by the business

Linked issue: Issue-4507

API and Format

org.apache.paimon.hive.HiveCatalog#alterTableToHms