AntelopeIO / spring

C++ implementation of the Antelope protocol with Savanna consensus
Other
5 stars 2 forks source link

Test Failure: disaster_recovery_2 #390

Closed linh2931 closed 1 month ago

linh2931 commented 2 months ago

https://github.com/AntelopeIO/spring/actions/runs/10048012787/job/27771514855?pr=382#step:4:1595

Traceback (most recent call last):
  File "/__w/spring/spring/build/tests/disaster_recovery_2.py", line 140, in <module>
    assert node.waitForLibToAdvance(), "Node did not advance LIB after relaunch"
heifner commented 1 month ago

The test restarts 5 nodes from a snapshot. The test failed because node0 could not connect to any node when launched so it waited 30 seconds before trying again. During this 30 seconds it happened to be producing blocks. Once it was able to connect it was busy feeding blocks to the other node.

Options:

spoonincode commented 1 month ago

appears to still exist? https://github.com/AntelopeIO/spring/actions/runs/10272637950/job/28425737014 reopening for now but if felt strongly can open new one instead

spoonincode commented 1 month ago

noticed again on https://github.com/AntelopeIO/spring/actions/runs/10376337986/job/28728623008 though on an UBSAN build which of course goes kinda on the slow side. Should we reopen this one or do a new issue?