Open v0y4g3r opened 1 year ago
Performance
ParquetWriter already supports bloom filter encoding, but we have to apply query clauses to bloom filters during table scan.
ParquetWriter
Once we can build external index file, we may also switch to xor filter and it's rust implementation for better performance.
@v0y4g3r Any progress?
@v0y4g3r What's the plan for this issue? I am not sure if we still need it.
IMO, we should do some benchmarks to compare with the inverted index later as parquet already supports it.
What type of enhancement is this?
Performance
What does the enhancement do?
ParquetWriter
already supports bloom filter encoding, but we have to apply query clauses to bloom filters during table scan.Once we can build external index file, we may also switch to xor filter and it's rust implementation for better performance.