apache / parquet-java

Apache Parquet Java
https://parquet.apache.org/
Apache License 2.0
2.54k stars 1.39k forks source link

Column bloom filter: Show bloom filters in tools #2693

Open asfimport opened 2 years ago

asfimport commented 2 years ago

command result as follow:

parquet-tools bloom-filter BloomFilter.snappy.parquet row-group 0: bloom filter for column id: NONE

bloom filter for column uuid: Hash strategy: block Algorithm: block Compression: uncompressed Bitset size: 1048576

Reporter: BingKun Pan / @panbingkun

Note: This issue was originally created as PARQUET-2156. Please see the migration documentation for further details.

arthurpassos commented 1 month ago

I am also interested on this

wgtmac commented 1 month ago

@arthurpassos Good to see you again! Looking forward to your PR.

arthurpassos commented 1 month ago

@arthurpassos Good to see you again! Looking forward to your PR.

I still lack knowledge on parquet internals and the ecosystem to implement such features, but hopefully I'll be able to contribute some day

wgtmac commented 1 month ago

NP. Take your time