Open kazuyukitanimura opened 2 weeks ago
There are some tests in Spark 4.0 that uses parquet.writer.version=v2 (ParquetTypeWideningSuite).
parquet.writer.version=v2
ParquetTypeWideningSuite
The V2 write writes with delta encoding. Comet currently cannot read such files
No response
IIRC, the vectorized versions of these encodings in Spark did not improve performance much over the row based implementation in the parquet library
What is the problem the feature request solves?
There are some tests in Spark 4.0 that uses
parquet.writer.version=v2
(ParquetTypeWideningSuite
).The V2 write writes with delta encoding. Comet currently cannot read such files
Describe the potential solution
No response
Additional context
No response