delta-io / delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
https://delta.io
Apache License 2.0
7.22k stars 1.62k forks source link

[Spark] Fix validation of clustering columns #3273

Closed chirag-s-db closed 1 week ago

chirag-s-db commented 1 week ago

Which Delta project/connector is this regarding?

Description

Currently, clustering columns are validating by parsing a list of clustering columns. This is super brittle, and breaks when any clustering column has a comma in the name. Fix that by passing a list of clustering columns directly.

This fix resolves https://github.com/delta-io/delta/issues/3265

How was this patch tested?

Test-only change.

Does this PR introduce any user-facing changes?

No.