Closed xirc closed 2 years ago
From diagnosing logs of another test that also got the requirement failed error, the following situation could be a cause of this error:
RaftActor
of replica-group-106:15:50.049
: It appended log entry (index 788) into ReplicatedLog
06:15:50.262
: It updated the commit index to 78806:16:13.451
: It voted other RaftActor
in term 706:16:13.681
: It appended log entry (index 789) into ReplicatedLog
06:16:14.140
: It appended log entry (index 790) into ReplicatedLog
06:16:14.140
: It updated the commit index to 789RaftActor
of replica-group-206:06:04.726
: It became the leader in term 206:15:50.050
\~ 06:15:50.051
: It updated the commit index to 788
06:15:50.050
: It updated indices for replica-group-1 (next index = 788 -> 789, match index = 787 -> 788) by applying an event SucceededAppendEntries
06:15:50.051
: It updated indices for replica-group-3 (next index = 788 -> 789, match index = 787 -> 788) by applying an event SucceededAppendEntries
06:15:56.314
: It started a replication of log entry (index 789)06:16:01.388
: It(leader) became a follower by detecting a new term (term 3)06:16:18.102
: It restarted due to a persistence failure06:16:33.837
: It voted the RaftActor
of replica-group-3 on term 1106:16:36.075
: It updated the commit index (0 -> 789)06:16:36.076
: It got the requirement failed errorRaftActor
of replica-group-306:15:50.049
: It appended log entry (index 788) into ReplicatedLog
06:15:50.262
: It updated the commit index to 78806:16:13.448
: It became the leader in term 7
RaftActor
of replica-group-1 voted this RaftActor
(of replica-group-3)06:16:13.630
: It succeeded in replication of NoOp
(index 789) and then updated the commit index to 789
SucceededAppendEntries
06:16:21.930
: It updated the commit index to 83206:16:22.176
: It became a follower by detecting a new term (term 8)06:16:24.727
: It became the leader in term 1106:16:24.727
: It updated indices for all replica groups (next index = None, match index = None)06:16:24.737
\~ 06:16:24.824
:
06:16:32.924
\~ 06:16:33.700
:
06:16:32.924
: It updated the next index for replica-group-2 (None -> 833)06:16:32.924
: It decremented the next index for replica-group-2 (833 -> 832)06:16:33.700
: It decremented the next index for replica-group-2 (736 -> 735)06:16:34.037
:
There might be at least two possible solutions:
There might be a reason behind the next index being lower than expected:
AppendEntries
messages at some heartbeat interval, and the follower replies with multiple AppendEntriesFaield
messages, which decrease the next index than expectedAppendEntries
(on batch mode), and the follower replies with multiple AppendEntriesFaield
, which decreases the next index than expected
The following error occurred in some fault injection tests: