issues
search
mlfoundations
/
MINT-1T
MINT-1T: A one trillion token multimodal interleaved dataset.
779
stars
20
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Error when loading MINT-1T-PDF-2023-06
#13
hancheolcho
opened
1 month ago
4
How to load the dataset?
#12
L-hongbin
closed
2 months ago
2
would you share your data processing code
#11
MarStarck
closed
2 months ago
2
Why is MINT data interleaved?
#10
nhsjgczryf
closed
3 months ago
1
How to align image data in json file with tiff image?
#9
chenyehuang
closed
3 months ago
5
Add files via upload
#8
anas-awadalla
closed
4 months ago
0
This is truly a huge amount of data.
#7
limhasic
closed
4 months ago
2
LICENSE?
#6
brianjking
closed
4 months ago
1
Date when the dataset will be open-sourced
#5
ChencongZJU
closed
4 months ago
1
Why is OBELICS generally better than MINT-1T (HTML)?
#4
lijinginfo
closed
2 months ago
2
chore: update README.md
#3
eltociear
closed
5 months ago
0
Related Work
#2
TobiasLee
closed
5 months ago
3
V1 release
#1
anas-awadalla
closed
5 months ago
0