Open findepi opened 2 years ago
Iceberg column_sizes is
column_sizes
Map from column id to the total size on disk of all regions that store the column. Does not include bytes necessary to read other columns, like footers.
(https://iceberg.apache.org/spec/)
We use this field to fill io.trino.spi.statistics.ColumnStatistics#dataSize in Iceberg, but this should be uncompressed data size.
io.trino.spi.statistics.ColumnStatistics#dataSize
L
ColumnStatistics#dataSize
fixed(L)
cc @danielcweeks @homar @ebyhr @alexjo2144
@findepi I think short term is done (https://github.com/trinodb/trino/pull/15186)
Iceberg
column_sizes
is(https://iceberg.apache.org/spec/)
We use this field to fill
io.trino.spi.statistics.ColumnStatistics#dataSize
in Iceberg, but this should be uncompressed data size.L
asColumnStatistics#dataSize
forfixed(L)
columns