NVIDIA / nccl

Optimized primitives for collective multi-GPU communication
Other
3.14k stars 791 forks source link

How sendProxyProgress() in net.cc works #1319

Open ZhiyiHu1999 opened 3 months ago

ZhiyiHu1999 commented 3 months ago

Hello! I used some tracing tools to trace all-reduce operation in NCCL and found that the execution of runRing in all_reduce.h in GPU are always related to sendProxyProgress() in net.cc which seems to be related to CPU. I wonder whether you could kindly provide me some hints about the function of sendProxyProgress() and what are the functions of net.cc and proxy.h. Thanks a lot!

shanleo2024 commented 3 months ago

Hi, you can check some information from the following link: https://blog.csdn.net/shanleo1986/article/details/137777787

ZhiyiHu1999 commented 3 months ago

Hi, you can check some information from the following link: https://blog.csdn.net/shanleo1986/article/details/137777787

Thanks a lot!