vision-language-transformer Search Results

1000+ results
for vision-language-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

shenyunhang/APE #26

无法正确load配置文件；Can't load config file correctly

你好，我根据readme提供的路径下载了APE-D模型与配置文件运行脚本如下： `python demo/demo_lazy.py \ --config-file configs/LVISCOCOCOCOSTUFF_O365_OID_VGR_SA1B_REFCOCO_GQA_PhraseCut_Flickr30k/ape_deta/ape_deta_vitl_eva02_clip_vlf_…

CallMeFrozenBanana updated 10 months ago
4
2dot71mily/youtube_captions_corrections #2

Model checkpoints available and suggestions on language mode…

Hi! I am exploring sentence transformers for a visual scene detection application, to correct automated close captioning according to what is found in the analyzed video frame. For example, if the vid…

pablogranolabar updated 2 years ago
1
MDK8888/GPTFast #16

Possible to use with a VL model like LLAVA?

I am trying to use this project with a vision-language model like https://huggingface.co/docs/transformers/en/model_doc/llava_next but currently this repo does not support vision part of the model. I …

aliencaocao updated 7 months ago
2
huggingface/transformers #32435

[i18n-ar] Translating docs to Arabic (العربية)

Hi! !مرحبا! السلام عليكم Let's bring the documentation to all the Arabic-speaking community 🌏 (currently 0 out of 267 complete) Would you want to translate? Please follow the 🤗 [TRANSLATING guid…

AhmedAlmaghz updated 1 month ago
2
huggingface/transformers #33374

Track progress for VLMs refactoring

This issue tracks the progress on improving the handling and testing of Vision-Language Models. The main goals are to enhance/enable generation tests, handle other generation techniques like assisted …

zucchini-nlp updated 4 weeks ago
1
zer0int/CLIP-fine-tune #16

I want to fine-tune a complete text encoder model, but it se…

First of all, thank you for your work. I have a question for you. I want to fine-tune a complete text encoder model, but it seems that the model trained by ft-B-train-OpenAI-CLIP-ViT-L-14.py is a vis…

vxiaobai updated 2 weeks ago
8
helblazer811/ManimML #11

Transformer Visualization

I want to make visualization systems for visualizing transformers, specifically self-attention. It would be nice if it worked for Vision Transformers as well as Language Models.

helblazer811 updated 1 year ago
2
yyf17/NavigationProject #8

CVPR 2022

CVPR 2022 # 格式 * **Paper Title** *Author(s)* CVPR, 2022. [[Paper]](link) [[Code]](link) [[Website]](link) 需要填充： 1）Paper Title 2） Author(s) 3） 3个“link” 4）两篇文章之间间隔一行 # agent Meta Ag…

yyf17 updated 2 years ago
1
irthomasthomas/undecidability #722

MoAI/README.md at master · ByungKwanLee/MoAI

- [ ] [MoAI/README.md at master · ByungKwanLee/MoAI](https://github.com/ByungKwanLee/MoAI/blob/master/README.md?plain=1) # MoAI/README.md at master · ByungKwanLee/MoAI ## Description ![MoAI: Mixture…

irthomasthomas updated 2 months ago
1
zer0int/CLIP-fine-tune #4

PEFT fine tune CLIP VIT-G?

Hello again! Would it be possible to modify the GMP fine tune script to train a LoRA with PEFT for the CLIP VIT-G model? Then merge the LoRA with the model to get a new CLIP-G model? Chat-GPT se…

bash-j updated 5 months ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for vision-language-transformer

1000+ results
for vision-language-transformer