Closed xudong963 closed 2 days ago
I hereby agree to the terms of the CLA available at: https://docs.databend.com/dev/policies/cla/
Note: tpcds 1GB. Local machine M1 mac, 8c 32GB
This PR adds a fast path for cross join when build side only has a row, which matches tpcds q23's pattern(build side is from a max/sum agg).
By the fast path, we can avoid memory reallocation. Then reduce cache miss and improve the ability of SIMD for next operators.
Before: 26s
Now: 0.4s
This change isโ
I hereby agree to the terms of the CLA available at: https://docs.databend.com/dev/policies/cla/
Summary
Note: tpcds 1GB. Local machine M1 mac, 8c 32GB
This PR adds a fast path for cross join when build side only has a row, which matches tpcds q23's pattern(build side is from a max/sum agg).
By the fast path, we can avoid memory reallocation. Then reduce cache miss and improve the ability of SIMD for next operators.
Before: 26s
Now: 0.4s
Tests
Type of change
This change isโ