mjakubowski84 / parquet4s

Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster.
https://mjakubowski84.github.io/parquet4s/
MIT License
283 stars 65 forks source link

Feat(core): make path filter configurable #329

Open i10416 opened 11 months ago

i10416 commented 11 months ago

This change adds a pathFilter option to ParquetReader builder interface because there are some situations where users needs to configure path filter predicates(e.g. They use _ prefix for partition columns).

Currently, there seems no option to change default path filter(org.apache.parquet.hadoop.util.HiddenFileFilter)