neu-vi / SportsSloMo

SportsSloMo: A New Benchmark and Baseline Models for Human-centric Video Frame Interpolation, CVPR 2024 (https://arxiv.org/abs/2308.16876)
https://neu-vi.github.io/SportsSlomo/
64 stars 3 forks source link

Number of clips? #7

Open JHLew opened 11 months ago

JHLew commented 11 months ago

Hi, thank you for sharing your work to the public. I have tried downloading and processing the SportsSlomo dataset. Seems like it consists of 8498 clips, where as in the arxiv version of the paper, it claims to have 130K clips. According to the processing code in the repo, I believe 8498 clips is the correct number, but I couldn't find such number on the paper, so I was wondering if I am missing something here.

Is there a difference in counting the number of clips, or is this only a small portion of the full dataset?

playerkk commented 11 months ago

Thank you for your interest in our work.

The "clips" in the downloaded file, which should have been named as "sequences", are different from the ones we described in the paper. Here each sequence contains slow-motion content of a scene without any shot change. It may contain more than 9 frames, which can be useful for other tasks like dynamic view synthesis. The clip defined in the paper is just a subset of a sequence.

Please refer to the data processing and data loader of how to use these long sequences for video frame interpolation.