Closed himanshu-kun closed 10 months ago
The problem arose as the dev-ops couldn't figure out why Unknown
machine not turning Failed
and what are these weird logs.
We need to solve it by enhancing the logs enough to help the dev-ops know that MCM is throttling conversion of Unknown
to Failed
machine , and is following meltdown logic.
And we should also update the Playbook to update DoDs abt meltdown scenario. It can be a link to our docs.
/close
How to categorize this issue?
/area ops-productivity /kind enhancement /priority 2
What would you like to be added: Currently the meltdown scenario doesn't present good enough logs for any external user to figure out that meltdown control is taking place in MCM logs like these are present currently
This needs to be improved. A new status field could be thought of too(needs to be considered as part of #724 ). Also the Playbook needs to be updated to tell scenario when meltdown can happen.
Why is this needed: Clearer more understandable logs