iriscxy / VMSMO

Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''
34 stars 4 forks source link

403 when trying to download videos and images #4

Closed mateuk closed 3 years ago

mateuk commented 3 years ago

Hello, I'm trying to download the dataset. However, I encountered a problem when trying to download anything from some of the domains, mainly miaopai.video.weibocdn.com, which contains most of the links. I keep on getting the following response:

<head><title>403 Forbidden</title></head>
<body bgcolor="white">
<h1>403 Forbidden</h1>
<p>You don't have permission to access the URL on this server.<hr/>Powered by Tengine</body>
</html>

I attach a list of links that worked and list of links which didn't from the dev set.

Any ideas on what could be the reason? Is it possible that some kind of authorization is required?

StudentbyTheSea commented 3 years ago

hello,I have the same problem as you .How do you preprocess the video and image before train ? thanks

iriscxy commented 3 years ago

Hi, we checked the links, and it seems that some of the links have been out of date. So we provide the crawler code in the weibo-crawler folder, by which you can used to collect more new data.