google-research-datasets / conceptual-captions

Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image captioning systems.
Other
516 stars 26 forks source link

Any code for downloading the dataset? #1

Closed wangxinyu0922 closed 6 years ago

wangxinyu0922 commented 6 years ago

Thank you for your great work. Can you provide any example code for download images by urls from the tsv files? Some url can not be downloaded by urllib in python (IOError: [Errno socket error] [Errno 110] Connection timed out). But I can see the images in the browser.

dingnan-google commented 6 years ago

Thank you for your interest! Unfortunately, we cannot provide the code for downloading images by urls from the tsv files because of copyright/legal issues.