when i use :
CUDA_VISIBLE_DEVICES="3,4" python -m torch.distributed.launch --nproc_per_node 2 train_net.py
i meet some trouble, like this:
RuntimeError: CUDA error: an illegal memory access was encountered
terminate called without an active exception
How can I solve it,thanks a lot!
when i use : CUDA_VISIBLE_DEVICES="3,4" python -m torch.distributed.launch --nproc_per_node 2 train_net.py i meet some trouble, like this: RuntimeError: CUDA error: an illegal memory access was encountered terminate called without an active exception How can I solve it,thanks a lot!