Closed adithyaiyer1999 closed 1 month ago
The training dataset is approximately 145M images with resolution higher than 1024, of which 140M come from laion-highresolution and can be directly accessed from Hugging Face. The remaining part has no plan to release, but I believe this part is not important for training.
Thanks! Makes sense.
Hi!
Thanks for your great work. I had 2 questions regarding the dataset you trained on.
Thanks again! Adi