facebookresearch / VMZ

VMZ: Model Zoo for Video Modeling
Apache License 2.0
1.04k stars 157 forks source link

What is the difference between clip1 accuracy and video accuracy? #23

Closed wenjie710 closed 6 years ago

wenjie710 commented 6 years ago

Table 4 in the paper shows the clip1 accuracy and video1 accuracy of different methods, but I am confused about the difference between this two accuracies. It seems that the video1 accuracy is higer than clip1 accuracy.

murilovarges commented 6 years ago

Hi @wenjie710,

A video is composed of many clips (e.g. each clip with 32 frames), so clip1 accuracy is the rate of correct classification per clip. While Video accuracy is the rate of correct classification of a video after doing dense prediction aggregation "dense_prediction_aggregation.py" using all clips of a given video.

dutran commented 6 years ago

Thank @murilovarges for helping. As murilovarges mentioned, you can use dense prediction to compute the video-level accuracy. Alternatively, depend on you GPU memory, some small models or model trained on short-clips (8-frame or 16-frame clips), you can use test_net.py to test which can aggregate prediction from 10-clips per video.

JinyangGuo commented 5 years ago

Hi,

For the clip level accuracy, what if the last clip is not long enough? For example, we can split the video into several 16-frame clips. But the last clip is shorter than 16 frame. How can we handle this clip?