byteresearchcla / RealSI

RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
https://byteresearchcla.github.io/clasi/
Creative Commons Attribution 4.0 International
45 stars 4 forks source link

Can there be other download methods? Or upload downloaded audio data sets. #3

Open geekchen007 opened 3 days ago

geekchen007 commented 3 days ago

Data download failed. Can there be other download methods? Or upload downloaded audio data sets.

<urlopen error [Errno 111] Connection refused>. Retrying in 5 seconds...

byteresearchcla commented 2 days ago

Hi @geekchen007 , thank you for pointing out. It was an issue caused by pytube. We are trying to use other open-source tools and will update it soon.

byteresearchcla commented 1 day ago

@geekchen007 We don't own these data, so we can only provide download scripts. We did some test and it's already fixed. Do let us know if any further questions.

geekchen007 commented 12 hours ago

Thanks to the development team for their quick response. The new plan is more promising, but there are some minor problems.

Download Error: ERROR: [youtube] _K-eupuDVEc: Sign in to confirm you’re not a bot. Use --cookies-from-browser or --cookies for the authentication. See https://github.com/yt-dlp/yt-dlp/wiki/FAQ#how-do-i-pass-cookies-to-yt-dlp for how to manually pass cookies. Also see https://github.com/yt-dlp/yt-dlp/wiki/Extractors#exporting-youtube-cookies for tips on effectively exporting YouTube cookies