issues
search
salesforce
/
ALBEF
Code for ALBEF: a new vision-language pre-training method
BSD 3-Clause "New" or "Revised" License
1.53k
stars
195
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Pretrain phase problem
#45
haoshuai714
closed
2 years ago
11
Problem of NLVR_pretrain.yaml file
#44
haoshuai714
closed
2 years ago
17
A quick question about visual grounding and visualizing Grad-CAM
#43
bellos1203
closed
2 years ago
2
VQA VG answer weight
#42
jayleicn
closed
2 years ago
2
vqa training
#41
Aatrox00
closed
2 years ago
3
About VQA answer_list.json
#40
zengyan-97
closed
2 years ago
2
Configs for pre-training with more than 8 GPUs?
#39
lzongwei
closed
2 years ago
4
About pretraining process
#38
sunanhe
closed
2 years ago
2
After what epoch out of 30 epochs do you select the pre-training model?
#37
cdqncn
closed
2 years ago
2
the variance of loss is very large when initial training.
#36
shoutOutYangJie
closed
2 years ago
1
The max length of tokenizer is 25?
#35
shoutOutYangJie
closed
2 years ago
1
What is the difference between coco.json and coco_train.json
#34
shoutOutYangJie
closed
2 years ago
1
Grad-CAM visualization code
#33
pqviet
closed
2 years ago
2
How to download YFCC100M dataset?
#32
shoutOutYangJie
closed
2 years ago
1
what size of your A100 gpu's memory?
#31
shoutOutYangJie
closed
2 years ago
9
how to split train, val and test dataset of "flickr30k"?
#30
shoutOutYangJie
closed
2 years ago
3
Problems about the test results
#29
idejie
closed
2 years ago
2
Why step_size is set to be 100?
#28
shizhediao
closed
2 years ago
2
Problem in pretraining with SBU
#27
pqviet
closed
2 years ago
4
Got key error when loading weights finetuning on Visual Grounding
#26
pqviet
closed
2 years ago
1
The ability of Pretrained model for downstream tasks use directly
#25
EthanGreen75
closed
2 years ago
2
About memory allocation
#24
tingxueronghua
closed
2 years ago
3
how to test the model?
#23
CQUTWangHong
closed
2 years ago
14
Some difference between the paper and code
#22
cdqncn
closed
2 years ago
2
Questions about Fine-tuned Model Files
#21
cdqncn
closed
2 years ago
2
The number of captions of VG
#20
zdou0830
closed
2 years ago
2
Some results issue
#19
viyjy
closed
2 years ago
4
RuntimeError: invalid multinomial distribution (sum of probabilities <= 0)
#18
cdqncn
closed
2 years ago
5
Is this a typo?
#17
viyjy
closed
2 years ago
2
Would you release the pretrained checkpoint on 4M dataset?
#16
byougert
closed
2 years ago
2
questions about Grounding evaluation
#15
yechenzhi
closed
2 years ago
2
Key Error When reshaping position embedding.
#14
jipson7
closed
2 years ago
2
Cannot load image from CC3M
#13
viyjy
closed
3 years ago
8
Problem about the released checkpoint
#12
RERV
closed
3 years ago
2
Training on a single GPU
#11
tarunn2799
closed
2 years ago
4
results when using resnet?
#10
zhezh
closed
3 years ago
2
meta info about the pre-trained checkpoint
#9
snakeztc
closed
3 years ago
2
Update NLVR.py
#8
lzzk
closed
3 years ago
2
Got key error when loading weights finetuning on MSCOCO-retrieval
#7
wqtwjt1996
closed
3 years ago
3
CUDA Out of Memory
#6
aniketde
closed
3 years ago
3
Training with apex fp16
#5
kugwzk
closed
3 years ago
3
selecting pretraining checkpoints / monitoring pretraining performance
#4
jayleicn
closed
3 years ago
4
the number of training images in pretrained checkpoints
#3
wqtwjt1996
closed
3 years ago
3
pretraining datasets json files
#2
jayleicn
closed
3 years ago
7
Finetuned checkpoint for retrieval on Flickr30k?
#1
crowsonkb
closed
3 years ago
1
Previous