Difference between data and data_full

fpv-iplab / rulstm

Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention. International Conference on Computer Vision, 2019.

133 stars 33 forks source link

Hello, thank you for your interest in our work!

All features have been extracted at 30fps. To do so, we first converted all videos to this fixed framerate using the following command:

ffmpeg -i input.mp4 -c:v libx264 -crf 22 -r 30 -vsync cfr -an output.mp4

We extracted features from all frames of the converted videos and stored them into data_full.

To obtain data, we just discarded all frames which were not sampled during training, validation or testing by our method. In practice, we sampled 16 frames at 4fps before the beginning of each action. Please note that this does not correspond to a fixed framerate of 4fps as we align frames to the starting time-stamp of each action.

We provided data to reduce the download size, but I suggest to use data_full if you are implementing your own sampling scheme.

This does not correspond to a uniform framerate of 4fps

fpv-iplab / rulstm

Difference between data and data_full #7