Now Spark Bucketed table Compact will plan all files to know the buckets. We can introduce BucketEntry just like PartitionEntry to reduce memory usage.
This PR:
Introduce bucket entries to optimize Spark compact
Add BucketsTable system table to show bucket information.
Purpose
Now Spark Bucketed table Compact will plan all files to know the buckets. We can introduce
BucketEntry
just likePartitionEntry
to reduce memory usage.This PR:
Tests
BucketsTableTest
.API and Format
Documentation