mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.
730 stars 20 forks source link