Closed KDLPro closed 1 month ago
👋 Hello @KDLPro, thank you for raising an issue about the Ultralytics HUB 🚀! Please check out our HUB Docs for more detailed information:
It sounds like you might be encountering some issues with the model checkpoint. If this is a 🐛 Bug Report, please provide screenshots and steps to reproduce the problem to help us assist you better. Ensuring a Minimum Reproducible Example (MRE) will be very helpful.
If this is a ❓ Question, share as much relevant information as possible about your dataset, model, and environment.
This is an automated response, and an Ultralytics engineer will also assist you soon. We appreciate your patience and understanding! 😊
@KDLPro I just looked at your model and it looks like you can resume training from epoch 95. Can you try resuming again?
Oh I just resumed the model and it was done! Still it was weird that Roboflow is waiting for connection.
Maybe due to this?
@KDLPro Something is indeed strange with this model. Do you mind training it again (create new model)?
I'll try when I have time. But I do have results for all 100 epochs. Honestly, the app has problems when the system is disconnected and reconnected again.
Here are the results btw @sergiuwaxmann:
It might be a bug or smthn @sergiuwaxmann related to Internet connection
@KDLPro Indeed, we noticed this as well and will try to improve this feature. The resume works perfectly with our Cloud Training or if the environment doesn't change (e.g., local training - resumed from the same environment).
I see, also apparently the model is missing from my library as well.
Also, how do I calculate model accuracy?
@KDLPro What do you mean the model is missing from your library? Model mAP is shown in the model list.
It's missing, however I can access through the link directly
Ahh so mAP is equivalent to model accuracy?
Hello @KDLPro! Yes, mAP (mean Average Precision) is often used as a measure of model accuracy in object detection tasks. It evaluates how well the model predicts bounding boxes and classifies objects.
If you're experiencing issues with your model not appearing in the library, try refreshing the page or clearing your cache. If the problem persists, it might be a temporary glitch. You can still access it directly through the link, which is good.
Feel free to reach out if you have more questions or need further assistance. 😊
Gotcha, but which is more commonly used if you talk about model accuracy only? Since there's mAP50 and mAP50-95. Do I have to calculate model accuracy by taking the average of the two values @pderrenger?
Hello! When discussing model accuracy in object detection, mAP @KDLPro and mAP@50-95 are both important metrics:
Typically, mAP@50-95 is considered a more robust measure of accuracy as it evaluates the model's performance across various levels of overlap. You don't need to average the two; instead, use them to understand different aspects of your model's performance.
If you have further questions, feel free to ask! 😊
Gotcha, thank you very much!
Also, I have one final question. How would I be able to improve the performance of the model given that there is underfitting in the object loss and minor overfitting in the box loss?
Hello! To improve your model's performance, especially with underfitting in object loss and minor overfitting in box loss, consider the following strategies:
Data Augmentation: Enhance your dataset with techniques like flipping, rotation, and scaling to increase diversity and help the model generalize better.
Learning Rate Adjustment: Experiment with different learning rates. A learning rate that's too high can cause underfitting, while a lower rate might help the model converge better.
Regularization: Implement techniques like dropout or L2 regularization to reduce overfitting.
Model Architecture: Try using a more complex model if your current one is too simple, or simplify it if it's too complex for your dataset.
More Data: If possible, increase the size of your training dataset to provide more examples for the model to learn from.
Hyperparameter Tuning: Adjust other hyperparameters such as batch size, optimizer, and epochs to find the optimal configuration.
Feel free to experiment with these suggestions and see which combination works best for your specific case. If you have more questions, just let me know! 😊
Search before asking
Question
Right now, I'm training a model with the help of Ultralytics HUB. Here's the progress so far...
However, I encounter issues when continuing from previous checkpoint as earlier, the notebook was disconnected from the HUB so I tried restarting it:
The checkpoint saved in local storage is in epoch 68 though. How do I solve this?
Additional
No response