pytorch / elastic

PyTorch elastic training
BSD 3-Clause "New" or "Revised" License
730 stars 98 forks source link

Various improvements to `torch.distributed.launch` and `torch.distributed.run` (#60925) #155

Closed aivanou closed 3 years ago

aivanou commented 3 years ago

Summary: Pull Request resolved: https://github.com/pytorch/pytorch/pull/60925

Pull Request resolved: https://github.com/pytorch/pytorch/pull/60808

Issues resolved:

https://github.com/pytorch/pytorch/issues/60716 https://github.com/pytorch/pytorch/issues/60754

Differential Revision: D29413019

facebook-github-bot commented 3 years ago

This pull request was exported from Phabricator. Differential Revision: D29413019

aivanou commented 3 years ago

https://github.com/pytorch/pytorch/pull/61294