Open ajantha-bhat opened 4 months ago
Compared to listFiles API, inventory listing can be cost efficient for remove_orphan_files performance. So, we can enhance the procedure/action to accept the inventory information.
Reference: https://delta.io/blog/efficient-delta-vacuum/
Spark
Hi @ajantha-bhat - Don't we already support this after https://github.com/apache/iceberg/pull/4503?
@flyrain did some analysis on this internally. He may have some ideas here.
Feature Request / Improvement
Compared to listFiles API, inventory listing can be cost efficient for remove_orphan_files performance. So, we can enhance the procedure/action to accept the inventory information.
Reference: https://delta.io/blog/efficient-delta-vacuum/
Query engine
Spark