apache / parquet-java

Apache Parquet Java
https://parquet.apache.org/
Apache License 2.0
2.65k stars 1.41k forks source link

GH-3059: Add configuration to disable size statistics #3060

Closed wgtmac closed 4 days ago

wgtmac commented 1 week ago

Rationale for this change

Now size statistics are enabled globally. We need to better control the behavior when it is not necessary.

What changes are included in this PR?

Add a new flag parquet.size.statistics.enabled to control whether to enable size statistics globally. It also supports a per-column control.

Are these changes tested?

Yes, added a test case in TestParquetWriter.java.

Are there any user-facing changes?

Yes, a new flag parquet.size.statistics.enabled has been added.

Closes #3059

wgtmac commented 1 week ago

Could you please take a look? Thanks! @gszadovszky @Fokko