antoyang / VidChapters

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
http://arxiv.org/abs/2309.13952
MIT License
174 stars 21 forks source link

Video request #15

Open ustcwhy opened 11 months ago

ustcwhy commented 11 months ago

Thanks for your wonderful work!

I would like to download videos of VidChapters. Could you provide some cmd tools to quickly download these videos? And how much storage do you use for these videos?

Thanks~

antoyang commented 11 months ago

I suggest using yt-dlp for downloading the videos, e.g. yt-dlp -o "". For the storage, it depends what options you choose in terms of resolution / whether you download audio or not.

ustcwhy commented 11 months ago

Thanks! Could you provide your cmd for downloading these videos and the audio part, especially the args for video resolution and the audio quality?

antoyang commented 11 months ago

See https://github.com/yt-dlp/yt-dlp for details, IIRC I just used the flag res to choose a resolution like 512 or 448 pixels (anything higher than 224 should be fine).

ustcwhy commented 10 months ago

I try to use yt-dlp to download these videos. However, it seems to be very slow. My cmd is "yt-dlp -a video_url.txt -f 'bestvideo[height<=480]+bestaudio/best[height<=480]' " I also tried video2dataset. But it is easy to be blocked due to too many requests. Could you provide some advice to quickly download the src videos...