multi-head-self-attention Search Results

1000+ results
for multi-head-self-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #9324

[Feature]: Quantization support for LLaVA OneVision

### 🚀 The feature, motivation and pitch I'm working on applications that must run locally in resource-limited HW. Threrefore, quantization becomes essential. Such applications need from multimodal vi…

salvaba94 updated 3 weeks ago
2
SkBlaz/san #8

Extension of SAN for complex-valued input

Hi Blaz, First of, thank you for the open-access framework. I tested already some of the architecture on test data and produces great results. I was wondering whether I could pick your brain regard…

SantaTitular updated 5 months ago
12
thunlp/MuGNN #7

About adj matrix

Hi, I am looking into your code. But it seems that in `models.py`, the `self.multi_head_att_layers`(self-attention) and `self.relation_attention_gcns`(cross-KG attention) use the same adjacency mat…

EthanZhangYC updated 3 years ago
2
Aidenzich/road-to-master #53

Scalable Diffusion Models with Transformers

[Scalable Diffusion Models with Transformers](https://arxiv.org/pdf/2212.09748) Due to the remarkable achievements of Google AlphaFold 3, it also uses DiT, which combines Diffusion and Transformers…

Aidenzich updated 5 months ago
1
tomaarsen/SpanMarkerNER #49

Bert-based models crash

Hi there. Thanks for the great library! I have one issue regarding the usage of Bert-based models. I trained different models finetuning them on my custom dataset (roberta, luke, deberta, xlm-rober…

lambdavi updated 1 month ago
4
tensorflow/text #1017

Why are query, key, and value all set to the same value?

# Multi-head self-attention output (`tf.keras.layers.MultiHeadAttention `). attn_output = self.mha( query=x, # Query Q tensor. value=x, # Value V tensor. key=x, # Ke…

SanJoseCosta updated 2 years ago
2
icycookies/MV-Mol #1

Inquiry Regarding Stage Two of the Model – Multi-View Knowle…

Hi，I am currently working on the model you have described. While reviewing the related documentation, I have encountered some questions regarding StageTwo, "Multi-View Knowledge Integration." Specific…

Doris0712 updated 1 month ago
1
eubinecto/k4ji_ai #24

Attention

## To-do 다음의 섹션에 대해 스터디한 내용을 정리해보기! - [x] attention 이란? - [x] self-attention 이란? - [x] 3.2.1: Scaled Dot-Product Attention - [x] 3.2.3: Multi-Head Attention - [x] 3.2.4: Applications of Attenti…

eubinecto updated 2 years ago
4
bytedance/fc-clip #36

RuntimeError: The shape of the 2D attn_mask is torch.Size([7…

Hi bytedance, I was trying to reproduce the evaluation result of Cityscapes in the paper (test only table 2). I have done the necessary setup. When I try to run > python train_net.py \ --co…

vamsikrishna7909 updated 2 months ago
5
TencentARC/InstantMesh #131

Loading diffusion model ... Loading pipeline components...: …

**/tmp/tmppngxpwds.obj Traceback (most recent call last):** File "/home/jkx/anaconda3/envs/InstantMesh/lib/python3.10/site-packages/gradio/queueing.py", line 536, in process_events response =…

nagexiaochengzi updated 2 months ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for multi-head-self-attention

1000+ results
for multi-head-self-attention