Open rsemihkoca opened 1 month ago
👋 Hello @rsemihkoca, thank you for your detailed question about the YOLOv10 raw outputs 🚀!
This is an automated response to let you know that your query is being processed. An Ultralytics engineer will assist you soon to address your specific question about the one2one
calculations and the use of convolutional networks in the model.
In the meantime, I recommend checking out our Documentation for more insights into model architectures and outputs.
If this is a 🐛 Bug Report, make sure to provide a minimum reproducible example that can help us investigate further.
For real-time interactions and questions, join us on Discord 🎧. You might also find our Discourse and Subreddit helpful for community support.
Please ensure you're using the most recent version of the ultralytics
package with all dependencies, by running:
pip install -U ultralytics
You can also run YOLOv8 in a variety of environments:
Stay tuned for assistance from our team! 😊
You can read the YOLOv10 paper for the details on the architecture.
Search before asking
Question
When I look at the raw results from yolov10, I see that a value called one2one is calculated. What is this? The shape of the raw outputs is 300 to 6. There are 300 predictions, but most of them are in the same place. Does one2one reduce these? Why do we calculate cv3 ? When I examine the code, a convulsion neural network is used. I don't understand why this is done? I am generally curious about the one2one part. If the model is nms-free, what does this field do? I would expect the model to come as if nms was already applied in the raw outputs.
Additional