Closed Vilour closed 4 months ago
Hi,
I tried to run with python -m torch.distributed.run --standalone --nnodes=1 --nproc-per-node=4 train.py --bsz 4 -s But return the following errors:
python -m torch.distributed.run --standalone --nnodes=1 --nproc-per-node=4 train.py --bsz 4 -s
[W socket.cpp:601] [c10d] The IPv6 network addresses of (user, 53821) cannot be retrieved (gai error: -3 - Temporary failure in name resolution).
What's that meaning?
This is an error of torchrun. solved by add 127.0.0.1 user to /etc/hosts.
Hi,
I tried to run with
python -m torch.distributed.run --standalone --nnodes=1 --nproc-per-node=4 train.py --bsz 4 -s
But return the following errors:[W socket.cpp:601] [c10d] The IPv6 network addresses of (user, 53821) cannot be retrieved (gai error: -3 - Temporary failure in name resolution).
What's that meaning?