Open skerit opened 1 year ago
Try with the following command:
python -m torch.distributed.launch --nproc_per_node=4 --master_port=1234 main_train_vrt.py --opt ./options/rvrt/... --dist True
Note: --nproc_per_node = number of available GPUs
Try use the torchrun
instead the python -m
https://github.com/open-mmlab/mmdetection/issues/10024#issuecomment-1484982317 this might be helpful
I'm trying to train RVRT, but it just fails like this: