Closed tenzen-y closed 1 year ago
/assign @alculquicondor
Rebased.
/lgtm /approve
[APPROVALNOTIFIER] This PR is APPROVED
This pull-request has been approved by: alculquicondor
The full list of commands accepted by this bot can be found here.
The pull request process is described here
I fixed the logic to calculate minResources so that
calculatePGMinResource
treats the launcher as a replica of higher priority when we don't set priorityClasses.I faced the issue at https://github.com/kubeflow/mpi-operator/pull/540#issuecomment-1496012813.
Background: In the current implementation, if the launcher and workers have the same priority,
calculatePGMinResource
randomly selects prioritized replicas. This means the launcher might be treated as a lower priority than the worker replica when we don't set priorityClass in both replicas.