Closed lhy1024 closed 1 month ago
One participant seems to be resetting due to the priority, while another participant keeps skipping campaigning due to the expected primary.
There are two TSOs, 36295 and 40455.
Further more, the root cause is tso's priority uses ResetLeader
instead of moveLeader
.
The tso priority checker process is:
Secondary can be elected as new primary because of time gap which is not stable!!!
For example, if the secondary io jitters and doesn't elected as new primary, the old primary will be elected, and then it will loop through the priority check logic again.
transfer primary
is actually as move etcd leader
.ResetLeader
with transfer primary
, which is more time efficient and can avoid time lag.
Flaky Test
Which jobs are failing
CI link
https://github.com/tikv/pd/actions/runs/10399520880/job/28798931505
Reason for failure (if possible)
Anything else