val hadoopConfig = { // simplified for issue
"fs.s3a.impl.disable.cache", "true"
"fs.s3a.path.style.access", "true"
other settings like secrets and url
}
ParquetReader
.projectedGeneric(querySchema)
.options(Options(hadoopConf = hadoopConfig))
.read(Path("s3a://parquet-driver-spec/multipart_parquet/a1"))
I guess I misunderstand how partitions should be structured. The names of folders are incorrect in my example, they should be b=1 and b=2, with that all good
parquet4s version 2.18.0
I got 3 parquet files part-0000, part-0001, part-0002 inside directory on a S3 bucket.
With this url
s3a://parquet-driver-spec/multipart_parquet/a1
parquet4s read partitions and returns all rows.But if files are in this structure
parquet4s returns 0 rows
code: