microsoft / msccl

Microsoft Collective Communication Library
Other
314 stars 31 forks source link

Fix syncthreads #39

Closed saeedmaleki closed 2 years ago

saeedmaleki commented 2 years ago

when counts are different, we may have a situation where a reduction followed by a send may not be in sync, especially for LL protocol. This commit makes sure that there is a syncthreads between instructions within the same threadblock regardless of hasdep filed.