issues
search
clip-vil
/
CLIP-ViL
[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383
MIT License
401
stars
35
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Wrong visualization results of ViT-B/16 in gradcam_clip.ipynb
#35
wangq95
opened
10 months ago
0
pretrained weigths for VQA
#34
guanhdrmq
opened
1 year ago
0
How to reproduce the results of experiments that are shown in Table 7
#33
raven38
closed
10 months ago
1
Where can I found annotations for SNLI-VE?
#32
1219521375
closed
10 months ago
1
How to infer captions on my own images
#31
victorup
closed
10 months ago
1
CVE-2007-4559 Patch
#30
TrellixVulnTeam
opened
2 years ago
0
No 'tfm_gen' when trying to run feature extraction for vqa mcan
#29
fallcat
closed
2 years ago
1
Pythia Feature Extraction
#28
shamanthak-hegde
closed
2 years ago
1
Extracting image features using RN50 for Pythia
#27
shamanthak-hegde
closed
2 years ago
0
Checkpoint for SNLI-VE
#26
sramshetty
closed
1 year ago
2
About precompute
#25
StylesZhang
closed
2 years ago
2
Data dir for mcan_clip_grid_feature.py
#24
Fly2flies
closed
2 years ago
1
error in clip extraction code: precomute_imagenet_views.py
#23
wangqian621
closed
2 years ago
3
Errors occurred when extracting clip features using Resnet
#22
tianjunyu0871
closed
2 years ago
1
Errors occurred when extracting clip features using ViT-B/32
#21
tianjunyu0871
closed
2 years ago
1
CLIP-VIT-B-Transformer captioning results
#20
YuanEZhou
closed
2 years ago
1
The clip_feature
#19
Timon0327
closed
2 years ago
1
The extracted feature of the COCO dataset for caption
#18
liujiaheng
closed
2 years ago
4
The links to download CLIP features on the R2R/RxR environment are invalid.
#17
chenguanqi
closed
2 years ago
2
Train with a single GPU
#16
ruinianxu
closed
2 years ago
1
About the training time of Pythia
#15
tingxueronghua
closed
2 years ago
2
fix resize_pos_embed bug when extracting CLIP features
#14
jianjieluo
closed
2 years ago
0
bug in positional_embedding's weights when resizing.
#13
jianjieluo
closed
2 years ago
2
Checkpoint for GQA model
#12
aurooj
opened
3 years ago
1
Missed Link
#11
jdiazram
closed
3 years ago
1
configuration file for CLIP-Res50x4
#10
itsyoavshalev
closed
3 years ago
1
evaluating vqa using pythia
#9
itsyoavshalev
closed
3 years ago
1
Grad-CAM visualization code
#8
yangbang18
closed
3 years ago
0
Grad-CAM visualization code
#7
yangbang18
closed
3 years ago
1
Pretrained weights for image captioning
#6
zhuang93
closed
3 years ago
1
About clip feature extraction
#5
LittleDonkey1203
closed
3 years ago
1
How to combine CLIP with Oscacr(or VinVL)?
#4
594422814
closed
3 years ago
1
Why weights of R-50-grid.yaml is commented out?
#3
tshu-w
closed
3 years ago
0
Captioning model training script fails
#2
j-min
closed
3 years ago
11
MS COCO Caption scores with MLE objective
#1
j-min
closed
3 years ago
1