Closed stevendanna closed 1 year ago
Bisecting just the workload binary suggest that
https://github.com/cockroachdb/cockroach/pull/68608
accounts for about 4~6k of the drop. This merged the day after the roachprod fix.
I wonder if this is going to have something to do with statement caching and increased mutex contention. Just throwing things out there.
cc @rafiss
https://github.com/cockroachdb/cockroach/pull/68608 contains a change that makes all statements get prepared and cached no matter what.
This "prepare and cache all statements" behavior was disabled in https://github.com/cockroachdb/cockroach/pull/69313/commits/96c260f87546372603634c904e349646d5d56738 on August 24.
Also, there was a bug in the prepare
logic that was fixed in https://github.com/cockroachdb/cockroach/pull/69691/commits/60dd572e552137607a06465b5dd885e5184f1943 on September 1.
I'm not yet speculating about what the root cause is, just recounting what things have been changing in workload and when. So we didn't see any additional improvement from 96c260f87546372603634c904e349646d5d56738 or 60dd572e552137607a06465b5dd885e5184f1943 ?
I'm not yet speculating about what the root cause is, just recounting what things have been changing in workload and when. So we didn't see any additional improvement from 96c260f or 60dd572 ?
I don't think so, but the data is pretty noisy compared to some of the other kv tests.
We have marked this issue as stale because it has been inactive for 18 months. If this issue is still relevant, removing the stale label or adding a comment will keep it active. Otherwise, we'll close it in 10 days to keep the issue queue tidy. Thank you for your contribution to CockroachDB!
Closing since this is stale and the throughput of this test has now recovered to 145k req/s
Describe the problem
The performance of
kv95/enc=false/nodes=3/cpu=96
regressed around 27 July from ~120k request to ~110k requests.https://roachperf.crdb.dev/?filter=cpu%3D96&view=kv95%2Fenc%3Dfalse%2Fnodes%3D3%2Fcpu%3D96&tab=gce
While this aligns with the introduction of the roachprod environment variable bug, it did not recover after that bug was fixed.
Testing with a binary built from
9baaa282b3
seems to reveal that we can see a similar performance drop just by varying the workload, roachtest, and roachprod binaries.For example, using all binaries from
9baaa282b3
But using cockroach from
9baaa282b3
but workload, roachtest, and roachprod from master:Jira issue: CRDB-10044