snap-research / Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
https://snap-research.github.io/Panda-70M/
438 stars 15 forks source link

Is storage space for clips, or all the videos? #45

Open vedantroy opened 2 months ago

vedantroy commented 2 months ago

Hi, I was wondering if the storage space specified in the README was for the clips, or for all the videos? I've downloaded 67M/70M clips (discarded the videos), but according to rclone, the storage space is only 1.4TB.

vedantroy commented 2 months ago

Update -- quick mistake, I downloaded ~ 65M clips, but the amount of storage used is actually ~ 175TB. I'm guessing this is because I always downloaded best quality.

From reading the paper, did you guys limit downloading to 720p?