iscyy / yoloair

🔥🔥🔥 专注于YOLOv5,YOLOv7、YOLOv8、YOLOv9改进模型,Support to improve backbone, neck, head, loss, IoU, NMS and other modules🚀
https://github.com/iscyy/yoloair
GNU General Public License v3.0
2.47k stars 428 forks source link

after training completion of all epochs , its crashing, not able to get final map,best.pt weight #154

Open akashAD98 opened 1 year ago

akashAD98 commented 1 year ago

i can able to do complete all training, once it reaches to last epoch its throwing an error & not able to get map,

also can we do testing of the trained model using detect.py?

the weight file size is 259 MB , I guess weight files are not downloading properly or they are corrupt. i tested it with detect.py & getting

RuntimeError: PytorchStreamReader failed reading zip archive: failed finding central directory
  Epoch   gpu_mem       box       obj       cls    labels  img_size
   119/119     1.63G   0.02918   0.05866  0.005124         3       640: 100%|██████████| 2311/2311 [09:13<00:00,  4.18it/s]                                                                                
               Class     Images     Labels          P          R     mAP@.5 mAP@.5:.95: 100%|██████████| 548/548 [01:08<00:00,  7.96it/s]                                                                  
                 all       2192       8590      0.982      0.939      0.969      0.758
Traceback (most recent call last):
  File "train.py", line 695, in <module>
    main(opt)
  File "train.py", line 591, in main
    train(opt.hyp, opt, device, callbacks)
  File "train.py", line 449, in train
    torch.save(ckpt, best)
  File "/home/sciffer/.local/lib/python3.8/site-packages/torch/serialization.py", line 424, in save
    return
  File "/home/sciffer/.local/lib/python3.8/site-packages/torch/serialization.py", line 290, in __exit__
    self.file_like.write_end_of_file()
RuntimeError: [enforce fail at inline_container.cc:445] . PytorchStreamWriter failed writing central directory for archive best: file write failed

@iscyy @WangQvQ please let me know how can I solve this issue