clip-vil CLIP-ViL issues

clip-vil / CLIP-ViL

[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383

MIT License

401 stars 35 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Wrong visualization results of ViT-B/16 in gradcam_clip.ipynb

#35 wangq95 opened 10 months ago
0
pretrained weigths for VQA

#34 guanhdrmq opened 1 year ago
0
How to reproduce the results of experiments that are shown in Table 7

#33 raven38 closed 10 months ago
1
Where can I found annotations for SNLI-VE?

#32 1219521375 closed 10 months ago
1
How to infer captions on my own images

#31 victorup closed 10 months ago
1
CVE-2007-4559 Patch

#30 TrellixVulnTeam opened 2 years ago
0
No 'tfm_gen' when trying to run feature extraction for vqa mcan

#29 fallcat closed 2 years ago
1
Pythia Feature Extraction

#28 shamanthak-hegde closed 2 years ago
1
Extracting image features using RN50 for Pythia

#27 shamanthak-hegde closed 2 years ago
0
Checkpoint for SNLI-VE

#26 sramshetty closed 1 year ago
2
About precompute

#25 StylesZhang closed 2 years ago
2
Data dir for mcan_clip_grid_feature.py

#24 Fly2flies closed 2 years ago
1
error in clip extraction code: precomute_imagenet_views.py

#23 wangqian621 closed 2 years ago
3
Errors occurred when extracting clip features using Resnet

#22 tianjunyu0871 closed 2 years ago
1
Errors occurred when extracting clip features using ViT-B/32

#21 tianjunyu0871 closed 2 years ago
1
CLIP-VIT-B-Transformer captioning results

#20 YuanEZhou closed 2 years ago
1
The clip_feature

#19 Timon0327 closed 2 years ago
1
The extracted feature of the COCO dataset for caption

#18 liujiaheng closed 2 years ago
4
The links to download CLIP features on the R2R/RxR environment are invalid.

#17 chenguanqi closed 2 years ago
2
Train with a single GPU

#16 ruinianxu closed 2 years ago
1
About the training time of Pythia

#15 tingxueronghua closed 2 years ago
2
fix resize_pos_embed bug when extracting CLIP features

#14 jianjieluo closed 2 years ago
0
bug in positional_embedding's weights when resizing.

#13 jianjieluo closed 2 years ago
2
Checkpoint for GQA model

#12 aurooj opened 3 years ago
1
Missed Link

#11 jdiazram closed 3 years ago
1
configuration file for CLIP-Res50x4

#10 itsyoavshalev closed 3 years ago
1
evaluating vqa using pythia

#9 itsyoavshalev closed 3 years ago
1
Grad-CAM visualization code

#8 yangbang18 closed 3 years ago
0
Grad-CAM visualization code

#7 yangbang18 closed 3 years ago
1
Pretrained weights for image captioning

#6 zhuang93 closed 3 years ago
1
About clip feature extraction

#5 LittleDonkey1203 closed 3 years ago
1
How to combine CLIP with Oscacr(or VinVL)?

#4 594422814 closed 3 years ago
1
Why weights of R-50-grid.yaml is commented out?

#3 tshu-w closed 3 years ago
0
Captioning model training script fails

#2 j-min closed 3 years ago
11
MS COCO Caption scores with MLE objective

#1 j-min closed 3 years ago
1