dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Apache License 2.0
3.22k stars 280 forks source link

Can provide laion-gpt4v dataset images zip? #35

Open lucasjinreal opened 7 months ago

lucasjinreal commented 7 months ago

Hi, after downloaded the laion-gpt4v images, I got only: 11686 images, am using json index order as image name, to avoid the bias between dataset, and possibly wrong annotation to image, just to be sure, does the last image index is: image

?

lucasjinreal commented 7 months ago

https://imt.boatwizard.com/images/1/48/29/4754829_20160406125957360_1_XLARGE.jpg broken 8411

yanwei-li commented 7 months ago

Hi, actually, we only download about 10K data in total for LAION-GPT-4V dataset without broken. So, I cannot ensure the last image index in this case.

lucasjinreal commented 7 months ago

I found there still some url broken but inside minigemini, is that possible have a imags.zip from your side? (This dataset is really lack of source and many links could broken at anytime)

lucasjinreal commented 7 months ago

Hello, is there any chance you could assist me in sharing the gpt4v-datasets from Laion? Some of the images appear to be broken and cannot be downloaded. If possible, a zip file of images from Hugging Face would be greatly appreciated. Could I trouble you for some assistance? @yanwei-li