rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
MIT License
3.74k stars 341 forks source link

Question about LAION-400M #418

Open BIGBALLON opened 8 months ago

BIGBALLON commented 8 months ago

From https://laion.ai/blog/laion-5b/ , we can get the average caption length for laion-2B dataset, image

Are there any Dataset Statistics for LAION-400M, especially average caption length?