An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Implemented OPTIMIZE tbl FULL to re-cluster all data in the table.
How was this patch tested?
new unit tests added
Does this PR introduce any user-facing changes?
Yes
Previously clustered table won't re-cluster data that was clustered against different cluster keys. With OPTIMIZE tbl FULL, they will be re-clustered against the new keys.
Which Delta project/connector is this regarding?
Description
How was this patch tested?
new unit tests added
Does this PR introduce any user-facing changes?
Yes Previously clustered table won't re-cluster data that was clustered against different cluster keys. With OPTIMIZE tbl FULL, they will be re-clustered against the new keys.