Kyligence / ClickHouse

ClickHouse® is a free analytics DBMS for big data
https://clickhouse.com
Apache License 2.0
14 stars 17 forks source link

[405] Eliminate data skew in hash shuffle #406

Closed lgbo-ustc closed 1 year ago

lgbo-ustc commented 1 year ago

Changelog category (leave one):

Since cityHash64 will be null if any argument is null, this could cause data skew. Remove the codes for marking null rows to belong to partition 0, we still keep the result correct.

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

...

Documentation entry for user-facing changes

Information about CI checks: https://clickhouse.com/docs/en/development/continuous-integration/

close #405

kyligence-git commented 1 year ago

Can one of the admins verify this patch?

lgbo-ustc commented 1 year ago

test this please

lgbo-ustc commented 1 year ago

test this please