salesforce ALBEF issues

salesforce / ALBEF

Code for ALBEF: a new vision-language pre-training method

BSD 3-Clause "New" or "Revised" License

1.53k stars 195 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Pretrain phase problem

#45 haoshuai714 closed 2 years ago
11
Problem of NLVR_pretrain.yaml file

#44 haoshuai714 closed 2 years ago
17
A quick question about visual grounding and visualizing Grad-CAM

#43 bellos1203 closed 2 years ago
2
VQA VG answer weight

#42 jayleicn closed 2 years ago
2
vqa training

#41 Aatrox00 closed 2 years ago
3
About VQA answer_list.json

#40 zengyan-97 closed 2 years ago
2
Configs for pre-training with more than 8 GPUs?

#39 lzongwei closed 2 years ago
4
About pretraining process

#38 sunanhe closed 2 years ago
2
After what epoch out of 30 epochs do you select the pre-training model?

#37 cdqncn closed 2 years ago
2
the variance of loss is very large when initial training.

#36 shoutOutYangJie closed 2 years ago
1
The max length of tokenizer is 25?

#35 shoutOutYangJie closed 2 years ago
1
What is the difference between coco.json and coco_train.json

#34 shoutOutYangJie closed 2 years ago
1
Grad-CAM visualization code

#33 pqviet closed 2 years ago
2
How to download YFCC100M dataset？

#32 shoutOutYangJie closed 2 years ago
1
what size of your A100 gpu's memory?

#31 shoutOutYangJie closed 2 years ago
9
how to split train, val and test dataset of "flickr30k"?

#30 shoutOutYangJie closed 2 years ago
3
Problems about the test results

#29 idejie closed 2 years ago
2
Why step_size is set to be 100?

#28 shizhediao closed 2 years ago
2
Problem in pretraining with SBU

#27 pqviet closed 2 years ago
4
Got key error when loading weights finetuning on Visual Grounding

#26 pqviet closed 2 years ago
1
The ability of Pretrained model for downstream tasks use directly

#25 EthanGreen75 closed 2 years ago
2
About memory allocation

#24 tingxueronghua closed 2 years ago
3
how to test the model？

#23 CQUTWangHong closed 2 years ago
14
Some difference between the paper and code

#22 cdqncn closed 2 years ago
2
Questions about Fine-tuned Model Files

#21 cdqncn closed 2 years ago
2
The number of captions of VG

#20 zdou0830 closed 2 years ago
2
Some results issue

#19 viyjy closed 2 years ago
4
RuntimeError: invalid multinomial distribution (sum of probabilities <= 0)

#18 cdqncn closed 2 years ago
5
Is this a typo?

#17 viyjy closed 2 years ago
2
Would you release the pretrained checkpoint on 4M dataset?

#16 byougert closed 2 years ago
2
questions about Grounding evaluation

#15 yechenzhi closed 2 years ago
2
Key Error When reshaping position embedding.

#14 jipson7 closed 2 years ago
2
Cannot load image from CC3M

#13 viyjy closed 3 years ago
8
Problem about the released checkpoint

#12 RERV closed 3 years ago
2
Training on a single GPU

#11 tarunn2799 closed 2 years ago
4
results when using resnet?

#10 zhezh closed 3 years ago
2
meta info about the pre-trained checkpoint

#9 snakeztc closed 3 years ago
2
Update NLVR.py

#8 lzzk closed 3 years ago
2
Got key error when loading weights finetuning on MSCOCO-retrieval

#7 wqtwjt1996 closed 3 years ago
3
CUDA Out of Memory

#6 aniketde closed 3 years ago
3
Training with apex fp16

#5 kugwzk closed 3 years ago
3
selecting pretraining checkpoints / monitoring pretraining performance

#4 jayleicn closed 3 years ago
4
the number of training images in pretrained checkpoints

#3 wqtwjt1996 closed 3 years ago
3
pretraining datasets json files

#2 jayleicn closed 3 years ago
7
Finetuned checkpoint for retrieval on Flickr30k?

#1 crowsonkb closed 3 years ago
1