mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.
779 stars 20 forks source link