Closed crodriguezo closed 3 years ago
It only needs to follow the corresponding start and end frames. The st_time and ed_time are marked manually, some frames may not contain the target person, which is slightly adjusted in the st_frame and ed_frame during the bounding boxes annotation.
Thank you
Hi,
I want to get some clarifications in the annotations
HCVG_train.json
. I played with the annotations to double-check the bounding boxes and the temporal, and I found an inconsitency between time and frame. Let me explain with few samples.Annotation
If we compute the frame using the fps of the video and the time, we got the following values:
Then, if we compute the frame rate for each point of the moment, we can see a considerable difference.
Another example
Annotation
Computation
I can found that inconsistency over every video. I wonder if it is related to the spatial annotations. Do you have any recommendation on how to deal with this for the evaluation?