CNVid / CNVid-3.5M

This repository contains the dataset, codebase, and benchmarks for our paper: <CNVid-3.5M: Build, Filter, and Pre-train the Large-scale Public Chinese Video-text Dataset>, which has been accepted by CVPR 2023.
Creative Commons Attribution 4.0 International
21 stars 0 forks source link

Unable to download videos. #1

Open lemoner20 opened 1 year ago

lemoner20 commented 1 year ago

It seems that the video urls provided in cn_vid3_5m_v1.csv are not downloadable. Is it possible to provide a dataset with readable addresses instead?

dongxingning commented 1 year ago

Thanks for your question. We have noticed that a small fraction of video urls were deleted by their users or platform. We will count the number of currently downloadable videos, and if the volume of missing videos is non-negligible, we will add some new videos to ensure the total number is 3.5M.

mingxing113 commented 1 year ago

Thanks for your work, but why now still not downloadable?

haodongze commented 1 year ago

the video urls provided in cn_vid3_5m_v1.csv 不能直接下载视频,是需要爬虫吗

dongxingning commented 1 year ago

Well, you a need web crawler to download the source videos. We provide the download script, which could be found at [download_cnvid_video.py]. And you need to change the url_head in Line 51. You could follow [https://blog.csdn.net/gjgfjgy/article/details/109437381] for detailed tutorials to find current url_head. Hope it will be helpful to you.

geekchen007 commented 1 year ago

Well, you a need web crawler to download the source videos. We provide the download script, which could be found at [download_cnvid_video.py]. And you need to change the url_head in Line 51. You could follow [https://blog.csdn.net/gjgfjgy/article/details/109437381] for detailed tutorials to find current url_head. Hope it will be helpful to you.

I made some attempts, but couldn't find the answer. May I ask if anyone is available? Share the website or tutorial

UestcJay commented 5 months ago

Well, you a need web crawler to download the source videos. We provide the download script, which could be found at [download_cnvid_video.py]. And you need to change the url_head in Line 51. You could follow [https://blog.csdn.net/gjgfjgy/article/details/109437381] for detailed tutorials to find current url_head. Hope it will be helpful to you.

How to download videos in linux?