Open liuxingbo12138 opened 8 months ago
I have two h100 servers, each connected to 8 400G Nics. How can I run nccl-test to run all the 8 400G Nics, and require GDR traffic to avoid the memory bandwidth becoming the bottleneck,nccl can do it?
I have two h100 servers, each connected to 8 400G Nics. How can I run nccl-test to run all the 8 400G Nics, and require GDR traffic to avoid the memory bandwidth becoming the bottleneck,nccl can do it?