apache / parquet-java

Apache Parquet Java
https://parquet.apache.org/
Apache License 2.0
2.65k stars 1.41k forks source link

GH-3055: Disable column statistics for all columns by configuration #3056

Closed wgtmac closed 1 week ago

wgtmac commented 1 week ago

Rationale for this change

https://github.com/apache/parquet-java/pull/2989 has added a per-column config to disable column statistics. This issue aims to add a global flag to disable column statistics for all columns.

What changes are included in this PR?

Add a new flag parquet.column.statistics.enabled to control whether to enable or disable column statistics for all columns.

Are these changes tested?

Yes, a new test case has been added to TestStatistics.java.

Are there any user-facing changes?

Yes, a new flag has been exposed to the user.

Closes #3055

wgtmac commented 1 week ago

Thanks @pan3793 @ConeyLiu @Fokko!