fudan-zvg / SETR

[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
MIT License
1.05k stars 150 forks source link

error for using dist_train.sh #44

Closed Lsz-20 closed 2 years ago

Lsz-20 commented 3 years ago

Excuse me I'm Trainning with multiple GPUs,for example:./tools/dist_train.sh ${CONFIG_FILE} ${GPU_NUM} [optional arguments] and I have 2 GPUs try to use Traceback (most recent call last): File "/home/anaconda3/envs/py37_torch1.6/lib/python3.7/runpy.py", line 193, in _run_module_as_main "main", mod_spec) File "/home/anaconda3/envs/py37_torch1.6/lib/python3.7/runpy.py", line 85, in _run_code exec(code, run_globals) File "/home/anaconda3/envs/py37_torch1.6/lib/python3.7/site-packages/torch/distributed/launch.py", line 261, in main() File "/home/anaconda3/envs/py37_torch1.6/lib/python3.7/site-packages/torch/distributed/launch.py", line 257, in main cmd=cmd) subprocess.CalledProcessError: Command '['/home/anaconda3/envs/py37_torch1.6/bin/python', '-u', './tools/train.py', '--local_rank=1', 'configs/SETR/SETR_Naive_768x768_40k_cityscapes_bs_8.py', '--launcher', 'pytorch', '--load-from=./pth/jx_vit_large_p16_384-b3be5167.pth']' returned non-zero exit status 1.

Thanks for your answer!

sixiaozheng commented 2 years ago

Maybe your environment is installed incorrectly, please refer to the A from-scratch setup script in the README.md to reinstall the environment. And run the command ./tools/dist_train.sh ${CONFIG_FILE} ${GPU_NUM}.

Lsz-20 commented 2 years ago

Maybe your environment is installed incorrectly, please refer to the A from-scratch setup script in the README.md to reinstall the environment. And run the command ./tools/dist_train.sh ${CONFIG_FILE} ${GPU_NUM}.

Thanks for your reply.I'll try again