er-muyue / DeFRCN

MIT License
182 stars 43 forks source link

AssertionError: Checkpoint checkpoints/voc/defrcn/defrcn_det_r101_base3/model_reset_surgery.pth not found #25

Closed zhengfang1997 closed 2 years ago

zhengfang1997 commented 2 years ago

i used bash run_voc.sh

File "/.../conda_envs/tpami/lib/python3.7/site-packages/fvcore/common/checkpoint.py", line 153, in load assert os.path.isfile(path), "Checkpoint {} not found!".format(path) AssertionError: Checkpoint checkpoints/voc/defrcn/defrcn_det_r101_base3/model_reset_surgery.pth not found!

how to solve it? Thx.

Traceback (most recent call last): File "/.../conda_envs/tpami/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 59, in _wrap fn(i, *args) File "/.../conda_envs/tpami/lib/python3.7/site-packages/detectron2/engine/launch.py", line 126, in _distributed_worker main_func(*args) File "/.../fsod/DeFRCN-main/main.py", line 61, in main trainer.resume_or_load(resume=args.resume) File "/.../fsod/DeFRCN-main/defrcn/engine/defaults.py", line 290, in resume_or_load self.cfg.MODEL.WEIGHTS, resume=resume File "/.../conda_envs/tpami/lib/python3.7/site-packages/fvcore/common/checkpoint.py", line 227, in resume_or_load return self.load(path, checkpointables=[]) File "/.../conda_envs/tpami/lib/python3.7/site-packages/fvcore/common/checkpoint.py", line 153, in load assert os.path.isfile(path), "Checkpoint {} not found!".format(path) AssertionError: Checkpoint checkpoints/voc/defrcn/defrcn_det_r101_base3/model_reset_remove.pth not found!

zhengfang1997 commented 2 years ago

bash run_voc.sh defrcn 3

zhengfang1997 commented 2 years ago

${SAVE_DIR}/defrcn_det_r101_base${SPLIT_ID}/model_final.pth not found too

alphacyp commented 2 years ago

i used bash run_voc.sh

File "/.../conda_envs/tpami/lib/python3.7/site-packages/fvcore/common/checkpoint.py", line 153, in load assert os.path.isfile(path), "Checkpoint {} not found!".format(path) AssertionError: Checkpoint checkpoints/voc/defrcn/defrcn_det_r101_base3/model_reset_surgery.pth not found!

how to solve it? Thx.

Traceback (most recent call last): File "/.../conda_envs/tpami/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 59, in _wrap fn(i, *args) File "/.../conda_envs/tpami/lib/python3.7/site-packages/detectron2/engine/launch.py", line 126, in _distributed_worker main_func(*args) File "/.../fsod/DeFRCN-main/main.py", line 61, in main trainer.resume_or_load(resume=args.resume) File "/.../fsod/DeFRCN-main/defrcn/engine/defaults.py", line 290, in resume_or_load self.cfg.MODEL.WEIGHTS, resume=resume File "/.../conda_envs/tpami/lib/python3.7/site-packages/fvcore/common/checkpoint.py", line 227, in resume_or_load return self.load(path, checkpointables=[]) File "/.../conda_envs/tpami/lib/python3.7/site-packages/fvcore/common/checkpoint.py", line 153, in load assert os.path.isfile(path), "Checkpoint {} not found!".format(path) AssertionError: Checkpoint checkpoints/voc/defrcn/defrcn_det_r101_base3/model_reset_remove.pth not found! Maybe it's because you didn't put the trained weight file in the correct path

er-muyue commented 2 years ago

Hi @zhengfang1997, The model_reset_remove.pth is generated in this line, please check the base training process is normal or not, and make sure the base model has been properly trained.