microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
https://www.deepspeed.ai/
Apache License 2.0
33.6k stars 3.94k forks source link

Update elastic_agent.py. Delete _get_socket_with_port import #5675

Closed QiaoZhennn closed 1 week ago

QiaoZhennn commented 1 week ago

cannot import _get_socket_with_port from torch.distributed.elastic.agent.server.api

kit1980 commented 1 week ago

Context: _get_socket_with_port has been removed from PyTorch in https://github.com/pytorch/pytorch/pull/125743 (and it was a private API, so no backward compatibility guarantee).

adk9 commented 1 week ago

Thanks for the fix. This is a duplicate of #5654.

loadams commented 1 week ago

Closing as a duplicate of #5654 as that PR is set to be merged.