Open alamb opened 2 days ago
@alamb are you working on this or would you like me to take it?
@alamb are you working on this or would you like me to take it?
I am not planning to work on this at this time. If you had time to look at it that would be great.
THank you 🙏
take
Is your feature request related to a problem or challenge? Please describe what you are trying to do. As @etseidl pointed out in https://github.com/apache/arrow-rs/pull/6466/files#r1778966728
We can use the new ParquetMetaDataLoader API to read the page indexes in more efficiently (fewer IOs for example)
However, when I tried to implement it, we caught what appears to be a subtle bug -- specifically that the predicates would have been ignored: https://github.com/apache/arrow-rs/pull/6466/files#r1783526090 -- no tests failed.
Describe the solution you'd like
I would like to:
SerializedFileReader::new_with_options
, and clean up the code to use the new ParquetMetaDataReaderDescribe alternatives you've considered leave as is
Additional context