trinodb / trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
https://trino.io
Apache License 2.0
10.49k stars 3.02k forks source link

Add missing columns to Iceberg system tables #24101

Open ebyhr opened 1 week ago

ebyhr commented 1 week ago

$properties table

This table is specific to Trino.

$history table

Trino supports all columns in Spark.

$metadata_log_entries table

Trino supports all columns in Spark.

$snapshots table

Trino supports all columns in Spark.

$manifests table

The column name for partitions is different. Trino uses partitions and Spark uses partition_summaries.

$partitions table

spec_id, position_delete_record_count, position_delete_file_count, equality_delete_record_count, equality_delete_file_count, last_updated_at(μs), last_updated_snapshot_id columns are missing. The column name for total size is different. Trino uses total_size and Spark uses total_data_file_size_in_bytes.

$files table

spec_id, partition, sort_order_id, readable_metrics columns are missing.

$refs table

Trino supports all columns in Spark.

References: