RupertLuo / Valley

The official repository of "Video assistant towards large language model makes everything easy"
207 stars 14 forks source link

jukinmedia data in the new version of valley-65k data cannot be downloaded #32

Open will-wiki opened 7 months ago

will-wiki commented 7 months ago

Hello, thank you very much for the open source video training data, but I refer to get_jukinmedia_videourl.py in [Valley-Instruct-65k to obtain videourl data, but there is no response. At the same time, I saw that someone provided the corresponding video url link in SUSTech/valley_instruct_65k, but I found that clicking the link showed no permission

image

Would like to ask if there are other available methods to download data, thank you very much!

RupertLuo commented 7 months ago

The video link is dynamic, but indeed, another person told me that jukinmedia’s anti-crawler is now very strict, so I have recently been studying how to crawl these videos again.

will-wiki commented 7 months ago

@RupertLuo I would like to ask if there is any ready-made video data that has been downloaded

RupertLuo commented 7 months ago

I have it locally, but since it is a company server, I cannot upload it to the public domain through it.

will-wiki commented 7 months ago

@RupertLuo Okay, it's sad If you have a new data acquisition method, please inform us, thank you very much!

pengzhiliang commented 5 months ago

The same issue, unable to download even one video.

GroundMoRe commented 4 months ago

Exactly, the urls are unavailable. Request error: 403 Client Error: Forbidden for url: https://www.jukinmedia.com/api/public/video/downloadVideo/1160273

wjpoom commented 1 month ago

Hi, anyone have solved this?