Closed hawkw closed 1 month ago
Related: #4872
@hawkw - Is this considered done? Or we're using this issue to track the future work of making boot_on_fault
configurable by user? (there may already be a ticket for that but I haven't located that yet)
This is done --- can't believe I opened this issue and forgot to close it. Whoops!
Depends on #6455 (and probably also #6490).
Per RFD 486:
We should implement that.
Potentially, we could attempt to schedule a new start saga for an instance as part of the update saga that transitions it to
Failed
. However, regardless of whether or not we do that, there should definitely be a RPW that's responsible for periodically listing instances which are in theFailed
state and haveboot_on_fault
disciplines indicating that they should be restarted, and ensure that a start saga is started for those instances. Update sagas which have transitioned an instance toFailed
could just activate that background task.