Closed wuhuua closed 1 week ago
Hi @wuhuua, please add branch-* labels to identify which branch(es) this C-bug affects.
:owl: Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.
Hello, I am Blathers. I am here to help you get the issue triaged.
Hoot - a bug! Though bugs are the bane of my existence, rest assured the wretched thing will get the best of care here.
I have CC'd a few people who may be able to assist you:
If we have not gotten back to your issue within a few business days, you can try the following:
:owl: Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.
Hi @wuhuua, thanks for the report. To help me investigate, it would be helpful if you could gather some information for me:
You can send the results to me privately via this link.
tsdump sent, with a detailed description
It looks like CPU is intermittently reaching ~100% on individual nodes. KV requests to a high-utilization node are delayed, and since we're bulk-inserting, a given SQL statement is pretty likely to hit the slow node.
A good next step would be to try and figure out why CPU utilization is so high on certain nodes. We should be collecting CPU profiles automatically, which you can access via the logging directory cockroach-data/logs/pprof_dump
. Let's grab the profiles for several different nodes to make sure we capture an interesting period.
I also have a few questions about your hardware/workload:
@wuhuua I'm going to close this issue for now. Please feel free to open a new issue (or reply to this issue) if you have further questions.
Describe the problem I deployed a cluster using self-compiled CockroachDB v24.2.0-dev which has 50 nodes. When trying to start a write batch to the cluster, I found that CockroachDB may experience sudden high latency across the entire cluster during write operations.
There's no network problem during my writing process, so I wonder why these sudden high latency occurs.
My steps to run the cluster and write process:
/symmetricdbserver start --certs-dir=/local/certs --store=/symmetricdb/symmetricdb-data --listen-addr {{env "attr.unique.network.ip-address"}}:26259 --http-addr 0.0.0.0:8081 --join 0.symmetricdb-cluster-secure.service.consul:26259
Environment:
Jira issue: CRDB-41447