openucx / ucc

Unified Collective Communication Library
https://openucx.github.io/ucc/
BSD 3-Clause "New" or "Revised" License
195 stars 96 forks source link

UCC plugin for NCCL - adjustments #1021

Open yaeliyac opened 1 week ago

yaeliyac commented 1 week ago

What

Add loopback option for allgather

Why ?

When using UCC plugin for NCCL, using cuda_memcpy might cause deadlock

How ?

Add UCC_TL_UCP_ALLGATHER_USE_CUDA flag to control this

swx-jenkins3 commented 1 week ago

Can one of the admins verify this patch?