RangiLyu / nanodet

NanoDet-Plusāš”Super fast and lightweight anchor-free object detection model. šŸ”„Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphonešŸ”„
Apache License 2.0
5.63k stars 1.03k forks source link

Traning code freezed when saving best check point #539

Open aemior opened 8 months ago

aemior commented 8 months ago

I use 4x4090GPU and mobileone backbone with batch size 50 to train nanodet, the memory per gpu is about 23.3GB, but the training code is freeze when save the best check point at this line: https://github.com/RangiLyu/nanodet/blob/4d85d0cb15aa076261a85ab93e7d9e45b7f64fda/nanodet/trainer/task.py#L273 Any ideal to debug?