Closed nizhf closed 2 years ago
Hi @nizhf thanks for your interest in our work!
For training, no matter it's in Oracle or Detection mode we use only ground truth person trajectories and we consider all possible pairs. So let's say there're n person box in a sampled frame then we consider n(n-1) pairs. Kindly also refer to the following code snippet:
Thank you for the explanation. That means for a human-object pair without ground-truth interaction annotation, the label is set to a 50-d zero-vector, is that correct?
Thanks for your great work. I have a question to your training and evaluation process. How do you deal with the human-object pairs that are not annotated in the ground-truth when you train and evaluate the model in Oracle mode? Do you only use the ground-truth pairs as input, or you consider all possible pairs? Thank you.