ROCm / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration
http://pytorch.org
Other
219 stars 51 forks source link

nccl dump wait delay #1540

Closed ramcherukuri closed 1 month ago

ramcherukuri commented 1 month ago

Fixes #ISSUE_NUMBER SWDEV-475455 and SWDEV-473434.

NCCl Scan was initiated a bit early.