apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
https://paimon.apache.org/
Apache License 2.0
2.13k stars 842 forks source link

[core] Fix FieldCountAgg null value init process #3536

Closed xuzifu666 closed 2 weeks ago

xuzifu666 commented 2 weeks ago

Currently we had FieldAggregator to compute all field cout,but not support filter null field count,this pr is aim to support it.

Linked issue: https://github.com/apache/paimon/issues/3537

Null value is filter count by agg operation

API and Format

Documentation

JingsongLi commented 2 weeks ago

actually FieldCountAgg is the FieldNonNullCountAgg.

xuzifu666 commented 2 weeks ago

actually FieldCountAgg is the FieldNonNullCountAgg.

Add the logic to the FieldCountAgg and remove FieldNonNullCountAgg.