Closed yuzheyao22 closed 9 months ago
If I print the "vs"(verb_scores) in hoi.py, logits for all the 117 verbs are the same. It means that the model does not predict the verb, right?
For example when printing the _outputs['pred_verblogits'][0][0]
tensor([0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.], device='cuda:2')
Yes, the provided code output interactiveness predictions, which are used to apply non-interaction suppression (NIS) on verb classification results. Here "pred_verb_logits" is omitted.
If I print the "vs"(verb_scores) in hoi.py, logits for all the 117 verbs are the same. It means that the model does not predict the verb, right?