Open kwannoel opened 6 months ago
This issue has been open for 60 days with no activity.
If you think it is still relevant today, and needs to be done in the near future, you can comment to update the status, or just manually remove the no-issue-activity
label.
You can also confidently close this issue as not planned to keep our backlog clean. Don't worry if you think the issue is still valuable to continue in the future. It's searchable and can be reopened when it's time. 😄
One thing which @BugenZhao found is the old case where rpc connection can cause deadlock under high join amplification. There's a workaround provided:
Btw seems like many real-world usecases have inevitable join amplification. So observability alone is insufficient. We need some solution which can enable horizontal scaling to process the join amplification. i.e. when there's high join amp, increasing compute can help to quickly resolve it.
Additionally speeding up the join itself can also greatly help.
1M join amp is a good target to see if our system can reasonably handle it.
In streaming workload, high join amplification can lead to barrier latency spike.
So far our approaches to mitigate these have been more indirect, e.g. decoupling sink, adaptive rate limit. It can isolate the slowness to the specific stream job.
I think we should also explore optimizing join itself.