Open as51340 opened 2 weeks ago
You can see that at 15:17:58,885 start-partition-node was generated but I don't see corresponding stop-partition-node until the final generator started.
Is that... not what you expected? The final generator's job is specifically to emit those operations in case a partition is still ongoing.
I was expecting that after rand([10,60]) seconds stop-partition-node would be generated because of :
(defn nemesis-events
"Create a random sequence of nemesis events. Disruptions last [10-60] seconds, and the system remains undisrupted for some time afterwards."
[nodes-config]
(let [events [[{:type :info :f :start-partition-halves}
(gen/sleep (+ 10 (rand-int 51))) ; [10, 60]
{:type :info :f :stop-partition-halves}
(gen/sleep 5)]
[{:type :info :f :start-partition-ring}
(gen/sleep (+ 10 (rand-int 51))) ; [10, 60]
{:type :info :f :stop-partition-ring}
(gen/sleep 5)]
[{:type :info :f :start-partition-node}
(gen/sleep (+ 10 (rand-int 51))) ; [10, 60]
{:type :info :f :stop-partition-node}
(gen/sleep 5)]
[{:type :info :f :kill-node}
(gen/sleep (+ 10 (rand-int 51))) ; [10, 60]
{:type :info :f :heal-node}
(gen/sleep 5)]
[{:type :info :f :start-network-disruption :value [(keys nodes-config) net/all-packet-behaviors]}
(gen/sleep (+ 10 (rand-int 51))) ; [10, 60]
{:type :info :f :stop-network-disruption}
(gen/sleep 5)]]]
because I do start-partition-node then sleep then stop-partition-node.
However there is no stop-partition-node from 15:17:58 to the end.
I'm afraid debugging this is a bit beyond the scope of what I can offer as far as unpaid support. There's a lot of code here, I don't know what much of it is calling in to, or what's calling it. This might involve interactions with whatever top-level generator you're wrapping the nemesis generator in. Or it might be because the nemesis itself seems to go and do... maybe blocking IO? Really can't say, sorry!
understand, thanks!
Hi,
Sometimes at the end of the test the operation which should stop disruption isn't generated. Example from logs:
You can see that at 15:17:58,885 start-partition-node was generated but I don't see corresponding stop-partition-node until the final generator started.
This is the code I have for nemesis: