Question about flatten in HungarianMatcher

Thanks for your interest!

This part of the code is a bit confusing, but I understand it is correct. The operations on lines 119 and 123 will obtain the correct cost matrix. https://github.com/impiga/Plain-DETR/blob/6ad930bb85f5d10417ebe979780132a9a466a8e0/models/matcher.py#L114-L124

I hope the following comments can help.

# Final cost matrix
# Shape: [batch_size * num_queries, sum(num_target_boxes)]
# This matrix contains cost between all predictions and targets in a batch, including those not in the same image.
# The cost between predictions and targets in different images are not used (see below).
C = (
    self.cost_bbox * cost_bbox
    + self.cost_class * cost_class
    + self.cost_giou * cost_giou
)
# Shape: [batch_size, num_queries, sum(num_target_boxes)]
C = C.view(bs, num_queries, -1).cpu()

# sum(num_target_boxes)
sizes = [len(v["boxes"]) for v in targets]
indices = [
    # C.split(sizes, -1) splits C into a list of tensors, each tensor has size [batch_size, num_queries, num_target_boxes]
    # c[i] is the cost matrix between predictions and targets in the same image
    linear_sum_assignment(c[i]) for i, c in enumerate(C.split(sizes, -1))
]

impiga / Plain-DETR

Question about flatten in HungarianMatcher #18