Open chaosi-zju opened 1 month ago
All modified and coverable lines are covered by tests :white_check_mark:
Project coverage is 52.97%. Comparing base (
aded7c0
) to head (7fc9c12
).
:exclamation: Your organization needs to install the Codecov GitHub app to enable full functionality.
:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.
This Pr mixes fault self-healing and rescheduling. I think fault self-healing includes rescheduling, similar to when a node crashes, the workload corresponding to the pod on the node will regenerate the pod. This is completed by multiple controllers working together, including a scheduler. If the goal is self-healing, then multiple components need to be considered for coordination. If it is only rescheduling, then only the target of eviction and the conditions for stopping eviction need to be considered. Can we consider the design concept of the Descheduler project in the community
[APPROVALNOTIFIER] This PR is NOT APPROVED
This pull-request has been approved by: To complete the pull request process, please ask for approval from rainbowmango after the PR has been reviewed.
The full list of commands accepted by this bot can be found here.
What type of PR is this?
/kind design /kind documentation
What this PR does / why we need it:
This proposal aims to introduce a mechanism of active triggering rescheduling, which benefits a lot in application failover scenarios. This can be realized by introducing a new API, and a new field would be marked when this new API called, so that scheduler can perceive the need for rescheduling.
Which issue(s) this PR fixes:
Fixes part of #4840
Special notes for your reviewer:
Does this PR introduce a user-facing change?: