iriscxy / VMSMO

Official code and dataset link for ''VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles''
34 stars 4 forks source link

how to get the datasets(most of the urls you offered has been invalid #10

Open zhenzliu opened 2 years ago

zhenzliu commented 2 years ago

hello, most of the URLs you provide in the datasets will report an error 403 whether it is opened directly by the browser or requests.gets.

It seems that these URLs have been invalidated, and most of the data URLs will face this problem.so the download method you provided seems to be invalid.

May I ask you directly provide the source data set to download?

zhenzliu commented 2 years ago

Many URLs seem to be temporary links, and the timestamp has expired。

In addition, I have another question. The cookie in weiboSpider.py you provided is invalid. The comment is "cookie错误或已过期,请按照README中方法重新获取", but it is not mentioned in the README.

HenryJunW commented 2 years ago

Hello, did you successfully get the data? Thanks in advance!

gdg452 commented 1 year ago

Did you successfully get the data? Thanks in advance!