Closed samcmho closed 7 months ago
Add a note regarding setting ulimit -n 1048576 if orchestrators relies on SSH to launch processes to run communication patterns doing send-recvs between many GPU pairs
ulimit -n 1048576
Add a note regarding setting
ulimit -n 1048576
if orchestrators relies on SSH to launch processes to run communication patterns doing send-recvs between many GPU pairs