Closed PhoebusSi closed 1 year ago
Hi @PhoebusSi ! At the moment, for legal reasons we cannot provide bulk access to raw image files. However, 1) we are looking into options because we understand this makes using the dataset more difficult; and 2) we are hoping to soon provide the multi-threaded downloading script that we used to gather many images in a short time.
[ edit: deferring details to @vegb who knows more than me about this step! ]
Added, thanks to @vegb https://github.com/allenai/mmc4/blob/main/scripts/download_images.py !
Downloading images directly from various websites is too slow. Do you have any packaged image files available?