Open lostlose opened 2 years ago
may be the reason is the batch_size is too small When I train yolov2, I let batch_size be 4 or 8, the loss will be nan When I up it to 32, that problem will not happen Hope this useful
@lostlose Please try my another YOLO project: https://github.com/yjh0410/PyTorch_YOLO-Family
@1023280072 Thank you! Due to memory constraints, I can only set the batch_size to 16, and now I'm trying to adjust the learning rate to get the correct results.
@yjh0410 OK, thanks, I will try it later!
It is nan even at batch size 48: I was using RN50 as backbone, and trying to train coco dataset.
Hi, I met the same problem in https://github.com/yjh0410/PyTorch_YOLOv3/issues/1 when I train yolov3 in this project.