multi-head-self-attention Search Results

pyg-team/pytorch_geometric #8972

Local multi-headed self-attention

### 🚀 The feature, motivation and pitch I am unable to find the clean implementation of local multi-headed self-attention in pytorch geometric. I found three types of multi-head attention, one Transf…

ck-amrahd updated 6 months ago

HXLH50K/U-Net-Transformer #3

Multi-head self/ cross attention

Hello:) Thank you so much for sharing your code. It has been very useful in understanding the paper. There is still something I don't quite get from the paper and the code. It From my understandin…

charleneolive updated 3 years ago

ZZZHANG-jx/DocRes #17

about position embedding

why doesn't the architecture need position embedding?

zhaozhaoooo updated 4 days ago

yukihito-jokyu/transformers_scratch #5

Transformerのアルゴリズムについてまとめる

Transformerのアルゴリズムについて調べ、その内容をまとめる。以下は参考文献 - [Transfrmerの論文](https://arxiv.org/pdf/1706.03762) - [論文の翻訳](https://hiroyukichishiro.com/attention-is-all-you-need/) - [アルゴリズムの解説①](https://qiita.…

yukihito-jokyu updated 1 week ago

NVIDIA/FasterTransformer #714

Does Fused Multi-head Attention support self-defined attenti…

### Branch/Tag/Commit main ### Docker Image Version nvcr.io/nvidia/pytorch:21.04-py3 ### GPU name 3090 ### CUDA Driver 525.89.02 ### Reproduced Steps ```shell Bert Model with self defined att…

zhanghaoie updated 1 year ago

keras-team/keras-io #1907

Error in Vision Transformer examples

### Issue Type Documentation Bug ### Source source ### Keras Version 2.14 ### Custom Code Yes ### OS Platform and Distribution Ubuntu 22.04 ### Python version 3.10 …

angelo-ml updated 4 days ago

triton-lang/triton #932

Test case from multi-headed self-attention tutorial fails

~~I am trying to run the test_op pytest on the fused attention tutorial (https://triton-lang.org/master/getting-started/tutorials/06-fused-attention.html) on a A100 with CUDA 11.4. The error is:~~ …

Lucy7298 updated 1 year ago

AILab-CVC/VideoCrafter #90

RuntimeError: The shape of the 2D attn_mask is torch.Size([7…

When running ```sh scripts/run_text2video.sh```, an error occurred. ``` [rank:0] batch-1 (1)x1 ... Traceback (most recent call last): File "/media/mil/cc-code/VADER/VideoCrafter/scripts/evalua…

kasoushu updated 2 weeks ago

icycookies/MV-Mol #1

Inquiry Regarding Stage Two of the Model – Multi-View Knowle…

Hi，I am currently working on the model you have described. While reviewing the related documentation, I have encountered some questions regarding StageTwo, "Multi-View Knowledge Integration." Specific…

Doris0712 updated 1 week ago

pyg-team/pytorch_geometric #9511

GPSConv structural encodings.

### 🚀 The feature, motivation and pitch In the original implementation of the GPSLayer (found in [graphgps/layer/gps_layer.py](https://github.com/rampasek/GraphGPS/blob/main/graphgps/layer/gps_layer.…

rballeba updated 4 weeks ago

1000+ results for multi-head-self-attention

1000+ results
for multi-head-self-attention