issues
search
allenai
/
mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
MIT License
904
stars
34
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
update google form link for requesting full dataset
#23
jmhessel
closed
6 months ago
0
Accessing MMC4-core
#22
wusize
closed
6 months ago
2
Access to discarded higher aspect ratio images
#21
ganyeshprasanna
opened
7 months ago
0
Dataset available on Huggingface?
#20
snat-s
opened
10 months ago
0
Image downloading
#19
Moonteresa
opened
1 year ago
0
fix: fix local download function with no args
#18
Chlience
closed
1 year ago
1
download_images.py use local() with no args
#17
Chlience
closed
1 year ago
0
some shards cannot be accessed with 404 error
#16
TobiasLee
opened
1 year ago
2
The data performance is inconsistent with the paper
#15
drunkpig
closed
1 year ago
4
Missing or broken images (due to stale URLs)
#14
pfischer-nvidia
opened
1 year ago
8
update the corpus to v1.1
#13
jmhessel
closed
1 year ago
0
CLIP ViT-L/14 weights
#12
josep-alana
closed
1 year ago
10
Multiple images for an identical matched_text_index
#11
fauconnier
closed
1 year ago
9
Duplicates and multiple versions of samples
#10
pfischer-nvidia
closed
1 year ago
5
Add the option of downloading images from the provided links.
#9
sramshetty
closed
1 year ago
0
Add image downloading script
#8
VegB
closed
1 year ago
0
Is there a quick way to download raw images?
#7
PhoebusSi
closed
1 year ago
2
A subset of image features for mmc4 core?
#6
HenryHZY
closed
9 months ago
2
Any recommended code for converting mmc4 into WebDataset format instead of jsonl format?
#5
roboswell
closed
1 year ago
8
Any plan to release the data processing code?
#4
duzx16
closed
1 year ago
8
add download script for fewer_facev2 and fewer_face_corev3
#3
Luodian
closed
1 year ago
4
feat: add all shards download & unzip script
#2
Luodian
closed
1 year ago
2
Some links are Unavaliable. They are:
#1
PhoebusSi
closed
1 year ago
3