zengyan-97 X-VLM issues

zengyan-97 / X-VLM

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

BSD 3-Clause "New" or "Revised" License

442 stars 51 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

The Little Daisy Bake Shop - New Website

#34 madelinedefrank closed 1 month ago
0
An error in the `Retrieval.py`

#33 jiajinuiuc opened 1 year ago
0
The torch version out of date

#32 Hoang-it opened 1 year ago
0
Will data leakage happen for bounding box prediction?

#31 1049451037 opened 1 year ago
0
About training data

#30 1049451037 closed 1 year ago
1
Loading frompretrained warnining

#29 lezhang7 opened 1 year ago
0
Where is the pretrained model's config file?

#28 lezhang7 opened 1 year ago
0
Dear Author，Is there any inference code for image retrieval？How can I use this project to inference on my own image-text pairs.

#27 wildwolff opened 1 year ago
1
apply an entire BERT as text encoder

#26 lxianl455 opened 1 year ago
1
VQA: Limitations in questions and answers

#25 fizahkhalid opened 1 year ago
1
VQA: Understanding how the model provides us an answer? Need of answer list?

#24 fizahkhalid opened 1 year ago
7
Code for Grad-CAM visualization

#23 qiaomu-miao opened 1 year ago
2
The code saves the best testing results on Image-Text Retrieval

#22 yangbang18 opened 1 year ago
0
NLVR Pretrain

#21 lonestar234028 closed 1 year ago
1
Finetuning On NLVR2

#20 lonestar234028 closed 1 year ago
1
About batch sampling `iter_perc`

#19 yangbang18 closed 1 year ago
1
Performance of different vision encoders

#18 AI-in-Health closed 1 year ago
1
Fine-tune on VQA

#17 darwann closed 1 year ago
2
About swin_B_480

#16 Sxx1995 closed 1 year ago
1
inferece api for referring expression comprehension

#15 zzh-tech opened 2 years ago
0
add web demo/models/datasets to ICML organization on Hugging Face

#14 AK391 opened 2 years ago
0
Script to generate RegionTextJsonDataset?

#13 daizuozhuo closed 1 year ago
4
Training log for the pretrain stage

#12 tgxs002 closed 2 years ago
2
Drawing Attention Heatmap

#11 TheodorPatrickZ opened 2 years ago
1
Fine-tuning

#10 TheodorPatrickZ closed 2 years ago
1
pretrain-base-4m for the X_VLM

#9 wfx0330 closed 2 years ago
1
About license

#8 WangWenhao0716 closed 2 years ago
2
Distributed mode for single GPU

#7 TheodorPatrickZ closed 2 years ago
2
Could you provide your training logs of coco caption? Thank you very much!

#6 pypypypy666 closed 2 years ago
1
Custom image inference

#5 SangMyeongWoh opened 2 years ago
0
Hi, could you provide the specific commands of finetuning on coco captioning? Thanks!

#4 yaolinli closed 2 years ago
1
Hello, please ask the train_files in configs/yaml file to "hdfs: // path / to / vg" error, please change how to set up

#3 zhanghehe8 closed 2 years ago
6
Great project, extremely looking for the releasing of the code!

#2 HenryHZY closed 2 years ago
1
什么时候开源啊

#1 cdqncn closed 2 years ago
5