delta-io / delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
https://delta.io
Apache License 2.0
6.96k stars 1.58k forks source link

[Spark] Extend Delta feature drop safety check to unbackfilled deltas #3028

Closed sumeet-db closed 2 weeks ago

sumeet-db commented 2 weeks ago

Which Delta project/connector is this regarding?

Description

When dropping a feature, DeltaLog now checks both backfilled and unbackfilled deltas for any traces of the feature before confirming it's safe to drop.

However, feature dropping currently does a checkpoint before detecting feature traces in the history, so there are no unbackfilled deltas at that point.

How was this patch tested?

UTs

Does this PR introduce any user-facing changes?

No