NVIDIA / nccl

Optimized primitives for collective multi-GPU communication
Other
3.28k stars 831 forks source link

Compute time in the reduction operation #1314

Open tks2004 opened 5 months ago

tks2004 commented 5 months ago

Is there any tool or an option which provides the compute time of the reduction operation