allenai / mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
MIT License
901 stars 34 forks source link

update the corpus to v1.1 #13

Closed jmhessel closed 1 year ago

jmhessel commented 1 year ago

update to v1.1 of the corpus