delta-io / delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
https://delta.io
Apache License 2.0
7.62k stars 1.71k forks source link

[Kernel] Assign base row ID to AddFile actions #3894

Open qiyuandong-db opened 9 hours ago

qiyuandong-db commented 9 hours ago

Which Delta project/connector is this regarding?

Description

This PR builds on the base changes which are not yet merged. For changes specific to this PR, please refer to the last commit only.

This PR implements the first part of row tracking support in Delta Kernel, based on the Delta Protocol. Specifically, it includes the following changes:

How was this patch tested?

Added tests in RowTrackingSuite.scala.

Does this PR introduce any user-facing changes?

No.