matrixorigin / matrixone

Hyperconverged cloud-edge native database
https://docs.matrixorigin.cn/en
Apache License 2.0
1.79k stars 276 forks source link

[Tech Request]: Discrepancy in Commit Wait Times Across tn and cn Nodes #18562

Open sukki37 opened 2 months ago

sukki37 commented 2 months ago

Is there an existing issue for the same tech request?

Does this tech request not affect user experience?

What would you like to be added ?

During a recent performance profiling session, we identified significant discrepancies in commit wait times across different nodes in the commit chain. Specifically:

Request:

Why is this needed ?

The 15-second difference between the logservice client call and the overall tn commit time, as well as the additional latency in the cn commit, suggests there are other inefficiencies in the commit chain that need to be addressed.

Additional information

out

tool: https://github.com/iovisor/bcc https://github.com/brendangregg/FlameGraph

command:

/bcc/tools/offcputime.py -df -p "$(pgrep -d ',' test)" 30 > out.stacks
./FlameGraph/flamegraph.pl --color=io --title="Off-CPU Time Flame Graph" --countname=us < out.stacks > out.svg
XuPeng-SH commented 2 months ago

as discussed in the last meeting, logtail serial apply may lead to the accumulation of latency

XuPeng-SH commented 2 days ago

下个迭代处理