Open mgrubent opened 3 years ago
The issue description is a little off. This is only relevant for a broker_failure
anomaly, under the following conditions:
Ideally, CC broker failure self-healing should act on a broker failure that happened before self-healing was enabled.
Ah; will update the title
Background
cruise-control
provides self-healing for a variety of anomalies.When an anomaly is detected, and the corresponding self-healing
boolean
is enabled,cruise-control
can generate a proposal on its own to correct the anomaly, and then execute that proposal.When these anomalies happen while the corresponding self-healing
boolean
is disabled,cruise-control
(as-expected) does not generate proposals to correct the anomaly.However, when an anomaly (e.g. a broker failure) occurs while self-healing is disabled, that same anomaly event will not cause
cruise-control
to generate a proposal when self-healing is later enabled.Problem
Anomalies that occur while self-healing are disabled can essentially be "lost" to
cruise-control
, and some other actor has to tell cruise-control affirmatively to act. Re-enabling self-healing is insufficient to causecruise-control
to heal the anomaly.Proposal
When
cruise-control
's anomaly-detection is re-enabled, any outstanding (i.e. not stale) anomalies should be acted on.