matrixorigin / matrixone

Hyperconverged cloud-edge native database
https://docs.matrixorigin.cn/en
Apache License 2.0
1.76k stars 274 forks source link

[Tech Request]: Discrepancy in Commit Wait Times Across tn and cn Nodes #18562

Open sukki37 opened 1 week ago

sukki37 commented 1 week ago

Is there an existing issue for the same tech request?

Does this tech request not affect user experience?

What would you like to be added ?

During a recent performance profiling session, we identified significant discrepancies in commit wait times across different nodes in the commit chain. Specifically:

Request:

Why is this needed ?

The 15-second difference between the logservice client call and the overall tn commit time, as well as the additional latency in the cn commit, suggests there are other inefficiencies in the commit chain that need to be addressed.

Additional information

out

tool: https://github.com/iovisor/bcc https://github.com/brendangregg/FlameGraph

command:

/bcc/tools/offcputime.py -df -p "$(pgrep -d ',' test)" 30 > out.stacks
./FlameGraph/flamegraph.pl --color=io --title="Off-CPU Time Flame Graph" --countname=us < out.stacks > out.svg
XuPeng-SH commented 1 week ago

as discussed in the last meeting, logtail serial apply may lead to the accumulation of latency

XuPeng-SH commented 7 hours ago

目前有更高优先级的事情在处理