vision-language-transformer Search Results

1000+ results
for vision-language-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

helblazer811/ManimML #11

Transformer Visualization

I want to make visualization systems for visualizing transformers, specifically self-attention. It would be nice if it worked for Vision Transformers as well as Language Models.

helblazer811 updated 1 year ago
2
huggingface/transformers #32435

[i18n-ar] Translating docs to Arabic (العربية)

Hi! !مرحبا! السلام عليكم Let's bring the documentation to all the Arabic-speaking community 🌏 (currently 0 out of 267 complete) Would you want to translate? Please follow the 🤗 [TRANSLATING guid…

AhmedAlmaghz updated 1 day ago
2
magic-research/PLLaVA #20

Issue with multi-GPU inference

I tried to run the demo on multiple RTX 3090 but got strange errors: ``` python3.10/site-packages/transformers/cache_utils.py", line 146, in update self.key_cache[layer_idx] = torch.cat([self.k…

AmitRozner updated 1 month ago
4
MDK8888/GPTFast #16

Possible to use with a VL model like LLAVA?

I am trying to use this project with a vision-language model like https://huggingface.co/docs/transformers/en/model_doc/llava_next but currently this repo does not support vision part of the model. I …

aliencaocao updated 6 months ago
2
e4exp/paper_manager_abstract #287

Perspectives and Prospects on Transformer Architecture for C…

- https://arxiv.org/abs/2103.04037 - 2021 トランスフォーマーアーキテクチャは、長年リカレントニューラルネットワークに支配されていた計算言語学の分野に根本的な変化をもたらしました。その成功は、言語と視覚のクロスモーダルなタスクにも劇的な変化をもたらし、多くの研究者がすでにこの問題に取り組んでいます。本論文では、この分野における最も重要なマイル…

e4exp updated 3 years ago
7
Project-MONAI/MONAI #7781

AttributeError: type object 'obj' has no attribute '_attn_im…

``` ====================================================================== ERROR: test_shape_0 (tests.test_transchex.TestTranschex) -----------------------------------------------------------------…

KumoLiu updated 4 months ago
1
mbzuai-oryx/GeoChat #19

Inference error

``` [2024-03-20 16:15:45,873] [INFO] [real_accelerator.py:110:get_accelerator] Setting ds_accelerator to cuda (auto detect) config.json: 100%|████████████████████████████████████████████████████████…

ZiruiSongBest updated 6 months ago
3
cambrian-mllm/cambrian #12

【bug】can not load cambrian-34b

in load_pretrained_model model = CambrianLlamaForCausalLM.from_pretrained( File "/usr/local/lib/python3.10/dist-packages/transformers/modeling_utils.py", line 3531, in from_pretrained ) =…

CSEEduanyu updated 2 months ago
17
2dot71mily/youtube_captions_corrections #2

Model checkpoints available and suggestions on language mode…

Hi! I am exploring sentence transformers for a visual scene detection application, to correct automated close captioning according to what is found in the analyzed video frame. For example, if the vid…

pablogranolabar updated 2 years ago
1
YoojLee/paper_review #71

BLIP-2: Bootstrapping Language-Image Pre-training with Froze…

# Summary 기존의 VLP는 from scratch로 학습을 시켰지만, 이는 pre-training cost가 너무 크며 기존에 잘 학습되었던 모델 (특히, LLM)에 대한 활용이 어려움. 따라서, frozen vision encoder와 frozen llm을 Q-Former (Querying Transformer)를 통해 잘 이어보는 방식으…

YoojLee updated 7 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for vision-language-transformer

1000+ results
for vision-language-transformer