Open cockroach-teamcity opened 4 weeks ago
Reviewing the logs, it didn't hit either of the core changefeed error logs in #127530 nor the timeout log in #127553. It seems like the test server just shut down randomly with a server shutting down: instructing cmux to stop accepting
message. Spot-checking a few similar past failures we had, they were all running with the secondary tenant:
Asked for help from #multi-tenant here: https://cockroachlabs.slack.com/archives/C02HWA24541/p1723839743273609
Looking at the logs from just this failure, it looks to me like the schema change stopped the feed despite our expectation that it wouldn't.
I240815 09:30:27.016476 14927345 ccl/changefeedccl/kvfeed/kv_feed.go:155 ⋮ [T10,Vcluster-10,nsql1,client=127.0.0.1:60082,hostssl,user=‹sinklessfeeduser›] 404 stopping kv feed due to schema change at 1723714222.413838743,1
Thanks for taking a look.
My interpretation (which might be wrong) was that the changefeed was going to restart, but I guess we can't really tell from the error message since the same error is returned for both restart and exit (aside: we have an issue to improve observability for this https://github.com/cockroachdb/cockroach/issues/124635): https://github.com/cockroachdb/cockroach/blob/2f8519c1ae5020614ee1616c829e1d5b3702f942/pkg/ccl/changefeedccl/kvfeed/kv_feed.go#L405-L407
I think some other evidence that it might not be because the changefeed stopped is that I don't see the logs that were added in this PR: https://github.com/cockroachdb/cockroach/pull/127530
ccl/changefeedccl.TestNoStopAfterNonTargetAddColumnWithBackfill failed on master @ 575cdd4696dfcac8f311d1ea546683271102f73e:
Parameters:
attempt=1
deadlock=true
run=1
shard=6
Help
See also: How To Investigate a Go Test Failure (internal)
/cc @cockroachdb/cdcThis test on roachdash | Improve this report!
Jira issue: CRDB-41352