Closed Yejing-Lai closed 1 week ago
Hi @mrwyattii. Please kindly review~ Thanks!
@mrwyattii FYI, this is to fix for a PyTorch API change that will affect DeepSpeed running with PyTorch nightly. Thanks!
Hi @mrwyattii. The failing test seems like an HTTP error. Could you please rerun the CI? Thanks!
@mrwyattii FYI, this is to fix for a PyTorch API change that will affect DeepSpeed running with PyTorch nightly. Thanks!
@delock - thanks, do you know what version this was added in, so we can know what the minimum pytorch version supported by this new code is?
@mrwyattii FYI, this is to fix for a PyTorch API change that will affect DeepSpeed running with PyTorch nightly. Thanks!
@delock - thanks, do you know what version this was added in, so we can know what the minimum pytorch version supported by this new code is?
AFAICT, _get_socket_with_port
only got removed from torch.distributed.elastic.agent.server.api
recently, but get_free_port
and get_socket_with_port
have existed in torch.distributed.elastic.utils.distributed
for a while -- at least going way back up to 3 years ago. So we shouldn't need to pin a minimum PyTorch version for this.
@mrwyattii FYI, this is to fix for a PyTorch API change that will affect DeepSpeed running with PyTorch nightly. Thanks!
@delock - thanks, do you know what version this was added in, so we can know what the minimum pytorch version supported by this new code is?
AFAICT,
_get_socket_with_port
only got removed fromtorch.distributed.elastic.agent.server.api
recently, butget_free_port
andget_socket_with_port
have existed intorch.distributed.elastic.utils.distributed
for a while -- at least going way back up to 3 years ago. So we shouldn't need to pin a minimum PyTorch version for this.
Sounds good, do you want to approve and we can get this merged @adk9 ?
The latest PyTorch deleted the '_get_socket_with_port' API, replacing it with 'get_free_port'.
Fixes: #5603