Object detection class only has car?

CAIC-AD / YOLOPv2

YOLOPv2: Better, Faster, Stronger for Panoptic driving Perception

MIT License

544 stars 67 forks source link

Seems like the model might be lumping car, truck, bus, and train into one category named "vehicles," and train for one class. similar to what other models within the same family(hybridnets, yolop etc) do for fair comparisons. But the ID 3 indeed feels odd.

Also myself and @nikkita-28, We took a peek at the code and spotted something interesting: the detection head's output tensor shows confidences for a whopping 80 classes(coco?) and the max out of that always comes out to be at index 3!

if multi_label:
      i, j = (x[:, 5:] > conf_thres).nonzero(as_tuple=False).T
      x = torch.cat((box[i], x[i, j + 5, None], j[:, None].float()), 1)
else:  # best class only
      conf, j = x[:, 5:].max(1, keepdim=True)

Any chance the devs could shed some light on this? Thanks a bunch!

CAIC-AD / YOLOPv2

Object detection class only has car? #53