neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs
https://neuralmagic.com/deepsparse/
Other
3.04k stars 173 forks source link

What is the format of the boxes returned from a YOLO Pipeline? #1663

Open EricPedley opened 3 months ago

EricPedley commented 3 months ago

It's XYXY top left,bottom right. I guess the negative values are just the model predicting that the object extends outside the image. Leaving this open because it's still a problem that this isn't documented well.

Is it xyxy? xywh? If xywh, is the xy the center or top-left? I ran a yolo pipeline on an image and there are negative numbers in the bounding box ouput. Are the coordinates relative to the image center or something? image

No info to be found in any of these places that would make sense

I would be more than happy to get a "RTFM" response if I could just get a link to where this is documented.

EricPedley commented 3 months ago

Ok I figured it out myself by searching the project for the string cv2.rectangle since I knew this had to be parsed somewhere to do annotation. It's XYXY top left,bottom right. I guess the negative values are just the model predicting that the object extends outside the image image

Gonna leave the issue open because this should be documented somewhere.

ZhianLin commented 3 months ago

Respectfully, how the hell did you respond so fast and how does downloading this help me? I'm like 99% sure you're a bot r

this download link is virus.