Open alamb opened 4 days ago
@etseidl predicted this error in https://github.com/apache/arrow-rs/pull/6081/files#r1774020124
I wonder if the metadata writer needs to modify the page index offsets/lengths in the ColumnMetaData if the indexes are not present in the ParquetMetaData. Then again, I could see wanting to preserve the page index offsets of the original file if you only want to save the footer metadata externally...perhaps an option on the metadata writer to preserve page index offsets if desired?
Describe the bug If the
ParquetMetadataReader
tries to read metadata written byParquetMetaDataWriter
without first loading the page indexes, you get an error like "missing required field ColumnIndex.null_pages"Nite this depends on https://github.com/apache/arrow-rs/pull/6463
To Reproduce The full reproducer is in https://github.com/apache/arrow-rs/pull/6463. Here is the relevant piece
Expected behavior The reader should not error
I am not sure if the right fix is to
Additional context @etseidl has added the APIs in https://github.com/apache/arrow-rs/pull/6431