Incremental read support for Iceberg tables

trinodb / trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Apache License 2.0

10.49k stars 3.02k forks source link

Trino currently supports reading data belonging to a particular Iceberg snapshot. Incremental read support helps to read only the changed data between snapshots. Not sure of the Trino convention but something like this; select count(*) from iceberg.testdb."table@{S1,S2}" - outputs only the inserted rows between S1(exclusive) and S2(inclusive).

Spark support;

append support after https://github.com/apache/iceberg/pull/315.
delete/overwrite mutations is work in progress : https://github.com/apache/iceberg/pull/2782

trinodb / trino

Incremental read support for Iceberg tables #8780