kubernetes-sigs / kueue

Kubernetes-native Job Queueing
https://kueue.sigs.k8s.io
Apache License 2.0
1.41k stars 256 forks source link

TAS: support rank-ordering for Pods #3533

Open mimowo opened 9 hours ago

mimowo commented 9 hours ago

What would you like to be added:

For Jobs which provide indexing (like batch/Job) we should place Pods with consecutive indexes (ranks) should be placed as close as possible in the topology tree.

The current implementation places pods pretty much randomly (as they show up in the API server).

Example, we have a jobs with 10pods: 0,1,2,3,4,5,6,7,8,9. We have 3 racks, each with 4 slots.

Why is this needed:

For improved performance of network communication between pods. This is especially important for AI/ML frameworks, where the pods exchange data in the ring structure (like in NCCL).

mimowo commented 9 hours ago

/assign

mimowo commented 9 hours ago

cc @PBundyra @mwysokin @mwielgus @tenzen-y