In Equality Delete, we build ColumnarBatchReader for the equality delete filter columns to read their values and determine which rows are deleted. If these filter columns are not among the requested columns, they are considered extra and should be removed before returning the ColumnBatch to Spark.
Suppose the table schema includes C1, C2, C3, C4, C5. If the query is: SELECT C5 FROM table, and the equality delete filter is on C3 and C4,
We read the values of C3 and C4 to identify which rows are deleted. However, we do not want to include these values in the ColumnBatch that we return to Spark.
In Equality Delete, we build
ColumnarBatchReader
for the equality delete filter columns to read their values and determine which rows are deleted. If these filter columns are not among the requested columns, they are considered extra and should be removed before returning theColumnBatch
to Spark.Suppose the table schema includes C1, C2, C3, C4, C5. If the query is:
SELECT C5 FROM table
, and the equality delete filter is on C3 and C4,We read the values of C3 and C4 to identify which rows are deleted. However, we do not want to include these values in the
ColumnBatch
that we return to Spark.