Closed AymenFJA closed 1 year ago
This is completely outside of the control of the mpi-operator.
You need to add a job queueing system (like https://kueue.sigs.k8s.io/docs/tasks/run_mpi_jobs/) or a gang scheduler.
NOTE: The kueue doesn't guarantee that all pods are scheduled to Node at the same time (gang scheduling). So I would suggest using Job queueing by kueue with sequential admission and gang-scheduling by scheduler-plugins.
On my company's production, they work fine :)
@alculquicondor @tenzen-y . Thank you so much, I really appreciate it, and things are way clearer now.
@tenzen-y , sorry, I should have asked before closing this issue. Can you share some initial steps on the approach that you mentioned, please? I am struggling to find a tutorial of steps helping to reproduce the mentioned setup. I really appreciate it.
@AymenFJA You can refer to the following documents:
Thanks, @tenzen-y, for sharing.
Dear all,
Is it possible to start multiple
Launcher
viareplicas
option as a way to run multiplempirun
invocations at once (within a single deployment)?Submitting multiple deployments leads to some of the Pods hanging on pending. For example,
MPIJob-1
andMPIJob-2
(1 launcher and 2 workers for both jobs) are submitted at the same time. This can cause for exampleMPIJob-1-launcher
to be running and the workerpending
forever or vise versa as thelauncher
fromMPIJob1
started but one of the workers, for example, does not have enough resources to start and so on. Here is an actual example:Am I missing something, or is my understanding of
Kubeflow-mpi-operator
is wrong, and is it not possible to do that? Also, is there an alternative way to have multiple MPIJobs to coexist at the same time in a coordinated manner?