训练时遇到的错误！

sunmooncode commented 2 years ago

Traceback (most recent call last):
  File "train.py", line 555, in <module>
    train(hyp, opt, device, tb_writer)
  File "train.py", line 377, in train
    is_coco=is_coco)
  File "/home/Face/YOLO-FaceV2/test.py", line 115, in test
    loss += compute_loss([x.float() for x in train_out], targets)[1][:5]  # box, obj, cls
  File "/home/Face/YOLO-FaceV2/utils/loss.py", line 224, in __call__
    dic[int(value)].append(indexs)
KeyError: 32

训练的时候双卡跑也会出现，单卡跑的时候就会出现上面错误！

sunmooncode commented 2 years ago

当我把batch-size设置为16的时候能够正常运行~

sunmooncode commented 2 years ago

训练过程中lrep会变成nan，是pytorch版本的问题嘛？