In my company's production environment, when use Flink session cluster for OLAP scan Paimon, we found the JobManager's memory is always heavy.
So, we will optimize this by two ways:
(1) Delete stats in DataSplit.
(2) When dataSkipping, cut unused stats in ManifestEntry.
Purpose
In my company's production environment, when use Flink session cluster for OLAP scan Paimon, we found the JobManager's memory is always heavy. So, we will optimize this by two ways: (1) Delete stats in DataSplit. (2) When dataSkipping, cut unused stats in ManifestEntry.
This pr is for (1)
Linked issue: close #xxx
Tests
API and Format
Documentation