Open rolurq opened 3 months ago
Hi there,
Thank you for reaching out and providing detailed information about the issue you're facing.
It looks like you're encountering an UnpicklingError
when trying to load your model after training. This error typically indicates that the file you're trying to load is corrupted or not in the expected format.
Here are a few steps you can take to troubleshoot and resolve this issue:
Verify Model File Integrity: Ensure that the model file (epoch-291.pt
) is not corrupted. You can try downloading the file again from the Ultralytics HUB to see if the issue persists.
Update Packages: Make sure you are using the latest versions of the Ultralytics and PyTorch packages. You can update them using the following commands:
pip install --upgrade ultralytics
pip install --upgrade torch
Re-run Training: Sometimes, re-running the training process can help resolve issues with corrupted files. Ensure that you have a stable internet connection during the training process to avoid any interruptions.
Check File Path: Ensure that the file path provided is correct and that the file exists at the specified location.
Use Local File: If the file is available locally, you can try loading it directly from your local system instead of using the URL:
model = YOLO('weights/epoch-291.pt')
If the issue persists after trying these steps, please provide additional details such as any error messages or logs you encounter. This will help us further diagnose the problem.
For more detailed guidance, you can refer to our Ultralytics HUB Quickstart Guide.
Feel free to reach out if you have any more questions or need further assistance. We're here to help! 😊
@rolurq It looks like you have a checkpoint for epoch 291. Can you try resuming training?
@sergiuwaxmann As I mentioned in the post, when I try to resume training it throws an exception, the exception is also in the post.
Search before asking
HUB Component
Models, Training
Bug
I trained my model using Collab and after it finished the model in the hub says 100% but that training hasn't finish. When I try to run training again on Collab to maybe trigger completion once more but when I do so it raises and exception and it can't run.
Environment
Ultralytics HUB Version v0.1.46 Client User Agent Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/127.0.0.0 Safari/537.36 Operating System Win32 Browser Window Size 2352 x 1352 Server Timestamp 1722690165
Minimal Reproducible Example
No response
Additional