Closed vwxyzjn closed 2 years ago
This PR Prototype torch.distributed integration for multi GPU
torch.distributed
cc @markelsanz14, I prototyped the torch.distributed integration but it's only 6% faster. I still feel I am missing the bottleneck somewhere because the prototype with CleanRL was like 25% faster
This PR Prototype
torch.distributed
integration for multi GPU