mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.
730 stars 20 forks source link

Date when the dataset will be open-sourced #5

Closed ChencongZJU closed 1 month ago

ChencongZJU commented 1 month ago

Thank you for your excellent work and when will this dataset be open-sourced?

anas-awadalla commented 1 month ago

We released the data earlier today! Check the readme for more details.