Open DorianZheng opened 1 month ago
Hi @DorianZheng, I’m interested in this task and would appreciate some clarification. Is the focus here is building an optimizer for position delete file, similar to what we did with IcebergEqualityDeleteRewriteRule
?
Enhancement
https://iceberg.apache.org/spec/#position-delete-files
According to the Iceberg spec, rows in the delete file must be sorted by
file_path
thenpos
, so that we don't have to read the entire delete file but instead seek the start page which contains the referenced data file path and end as soon as we read different data file path