问题描述:
我在执行 train_cls.py 时遇到了以下错误:
max_iters: 0
Traceback (most recent call last):
File "/home/tangfeilong/Desktop/ijcai/test/zhengzhou/TPRO-main/train_cls.py", line 374, in
train(cfg=cfg)
File "/home/tangfeilong/Desktop/ijcai/test/zhengzhou/TPRO-main/train_cls.py", line 217, in train
train_sampler.set_epoch(np.random.randint(cfg.train.max_iters))
File "mtrand.pyx", line 765, in numpy.random.mtrand.RandomState.randint
File "_bounded_integers.pyx", line 1247, in numpy.random._bounded_integers._rand_int64
ValueError: high <= 0
max_iters: 0
Traceback (most recent call last):
File "/home/tangfeilong/Desktop/ijcai/test/zhengzhou/TPRO-main/train_cls.py", line 374, in
train(cfg=cfg)
File "/home/tangfeilong/Desktop/ijcai/test/zhengzhou/TPRO-main/train_cls.py", line 217, in train
train_sampler.set_epoch(np.random.randint(cfg.train.max_iters))
File "mtrand.pyx", line 765, in numpy.random.mtrand.RandomState.randint
File "_bounded_integers.pyx", line 1247, in numpy.random._bounded_integers._rand_int64
ValueError: high <= 0
Killing subprocess 33595
Killing subprocess 33596
Traceback (most recent call last):
File "/home/ps/anaconda3/lib/python3.9/runpy.py", line 197, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/home/ps/anaconda3/lib/python3.9/runpy.py", line 87, in _run_code
exec(code, run_globals)
File "/home/tangfeilong/Desktop/ijcai/test/.local/lib/python3.9/site-packages/torch/distributed/launch.py", line 340, in
main()
File "/home/tangfeilong/Desktop/ijcai/test/.local/lib/python3.9/site-packages/torch/distributed/launch.py", line 326, in main
sigkill_handler(signal.SIGTERM, None) # not coming back
File "/home/tangfeilong/Desktop/ijcai/test/.local/lib/python3.9/site-packages/torch/distributed/launch.py", line 301, in sigkill_handler
raise subprocess.CalledProcessError(returncode=last_return_code, cmd=cmd)
subprocess.CalledProcessError: Command '['/home/ps/anaconda3/bin/python', '-u', 'train_cls.py', '--local_rank=1', '--config', './work_dirs/luad/classification/config.yaml']' returned non-zero exit status 1.
尊敬的作者:
您好!
我最近在尝试运行您在 GitHub 上这个项目时遇到了一些问题。我非常欣赏您的工作,并且遵循了项目文档中的所有指南和步骤,但不幸的是,我在尝试运行训练脚本时遇到了一些问题。
问题描述: 我在执行 train_cls.py 时遇到了以下错误: max_iters: 0 Traceback (most recent call last): File "/home/tangfeilong/Desktop/ijcai/test/zhengzhou/TPRO-main/train_cls.py", line 374, in
train(cfg=cfg)
File "/home/tangfeilong/Desktop/ijcai/test/zhengzhou/TPRO-main/train_cls.py", line 217, in train
train_sampler.set_epoch(np.random.randint(cfg.train.max_iters))
File "mtrand.pyx", line 765, in numpy.random.mtrand.RandomState.randint
File "_bounded_integers.pyx", line 1247, in numpy.random._bounded_integers._rand_int64
ValueError: high <= 0
max_iters: 0
Traceback (most recent call last):
File "/home/tangfeilong/Desktop/ijcai/test/zhengzhou/TPRO-main/train_cls.py", line 374, in
train(cfg=cfg)
File "/home/tangfeilong/Desktop/ijcai/test/zhengzhou/TPRO-main/train_cls.py", line 217, in train
train_sampler.set_epoch(np.random.randint(cfg.train.max_iters))
File "mtrand.pyx", line 765, in numpy.random.mtrand.RandomState.randint
File "_bounded_integers.pyx", line 1247, in numpy.random._bounded_integers._rand_int64
ValueError: high <= 0
Killing subprocess 33595
Killing subprocess 33596
Traceback (most recent call last):
File "/home/ps/anaconda3/lib/python3.9/runpy.py", line 197, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/home/ps/anaconda3/lib/python3.9/runpy.py", line 87, in _run_code
exec(code, run_globals)
File "/home/tangfeilong/Desktop/ijcai/test/.local/lib/python3.9/site-packages/torch/distributed/launch.py", line 340, in
main()
File "/home/tangfeilong/Desktop/ijcai/test/.local/lib/python3.9/site-packages/torch/distributed/launch.py", line 326, in main
sigkill_handler(signal.SIGTERM, None) # not coming back
File "/home/tangfeilong/Desktop/ijcai/test/.local/lib/python3.9/site-packages/torch/distributed/launch.py", line 301, in sigkill_handler
raise subprocess.CalledProcessError(returncode=last_return_code, cmd=cmd)
subprocess.CalledProcessError: Command '['/home/ps/anaconda3/bin/python', '-u', 'train_cls.py', '--local_rank=1', '--config', './work_dirs/luad/classification/config.yaml']' returned non-zero exit status 1.
我遵循了所有的设置步骤和文件结构,但似乎仍然有些问题。我想请问是否有可能是我遗漏了某些关键的配置步骤,或者可能是我对代码的理解有误?任何关于如何解决这个问题的建议都将非常有帮助。感谢您在开发这个项目过程中所做的努力,我真的很期待能够运行并使用它。
期待您的回复。