Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time ETL pipelines, and feature stores in minutes. Unified streaming and batch. PostgreSQL compatible.
7.04k
stars
579
forks
source link
Investigate what is the actual bottleneck in hash agg processing for dirty groups #18748
Open
kwannoel opened 1 month ago
It doesn't seem to be heap or cpu bottleneck. So what is the actual bottleneck, is it IO cost, due to lookups? If so we need a metric for it.
Or is it skew? because in some scenarios, the workload peaks at 1600%. But we have 32 cores.
Needs further investigation.