apache / parquet-java

Apache Parquet Java
https://parquet.apache.org/
Apache License 2.0
2.48k stars 1.37k forks source link

Parquet check-stats command only supports checking BINARY type columns. #1382

Open zhuyaogai opened 1 week ago

zhuyaogai commented 1 week ago

Hi, community, could you tell me why parquet check-stats command only supports checking BINARY type columns?

https://github.com/apache/parquet-java/blob/5ac50884e9ee27a1ce3ee18cc5ed4422147e5387/parquet-cli/src/main/java/org/apache/parquet/cli/commands/CheckParquet251Command.java#L104

Could we add some optional parameters so that it can detect all types of columns? I just want to use this tool to know whether there is any data corruption in the parquet file(not just for BINARY type columns).