Hi, when I test the result, I found that even though other parts is pretty fast, the nms tims cost is pretty high.
In this case, I test the time cost step by step and found that inds = torch.nonzero(scores[:,j]>0.01).view(-1) is super time consuming. It will takes nearly 50ms per iteration in k40c.
Hi, when I test the result, I found that even though other parts is pretty fast, the nms tims cost is pretty high.
In this case, I test the time cost step by step and found that
inds = torch.nonzero(scores[:,j]>0.01).view(-1)
is super time consuming. It will takes nearly 50ms per iteration in k40c.Does anyone has any ideas about that?