apache / parquet-format

Apache Parquet Format
https://parquet.apache.org/
Apache License 2.0
1.69k stars 422 forks source link

PARQUET-2139: Clarify ColumnChunk::file_offset is not used #439

Closed alamb closed 1 week ago

alamb commented 1 week ago

See also mailint list discussion: https://lists.apache.org/thread/r6r2cvzrdoorq6h6gqwh0b1hbfbhxv29

This field is not written correctly in the Java implementation and it not read in other implementations (like rust). Let's clarify this in the format docs

Jira/GH

Related to https://github.com/apache/parquet-java/issues/2678

Commits

alamb commented 1 week ago

Superseded by https://github.com/apache/parquet-format/pull/440