prestodb / presto

The official home of the Presto distributed SQL query engine for big data
http://prestodb.io
Apache License 2.0
16.07k stars 5.38k forks source link

Implement writes for Iceberg tables with Prestissimo #21355

Open ZacBlanco opened 1 year ago

ZacBlanco commented 1 year ago

Recently, I've been working on some bits for the iceberg connector that will support sampled statistics for the optimizer (#20937, #21296, #20993, and more). However, for sample maintenance to occur, we'll also need to write out those samples to a table in the filesystem somewhere.

Currently, in our java runtime implementation, the tables are written out to separate Iceberg tables at a known subdirectory. It would be ideal if we could keep the implementation the same between Java and C++ runtimes. This would require implementing write support for Iceberg tables in C++/Prestissimo.

Presto Component, Service, or Connector

Prestissimo/Iceberg

ZacBlanco commented 1 year ago

cc: @aditi-pandit