attention-mechanism Search Results

1000+ results
for attention-mechanism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HopkinsIDD/flepiMoP #281

Improve Python Exception Readability/Consistency

We should strive to make exceptions readable and consistent throughout `gempyor`. Some general style guidelines include: 1. Choosing the correct exception for the given issue instead of just defaul…

TimothyWillard updated 1 month ago
3
junia3/comments #11

blog/trnmultimodal

# Welcome to JunYoung's blog | Transformer와 Multimodal에 대하여 Attention mechanism [https://junia3.github.io/blog/trnmultimodal](https://junia3.github.io/blog/trnmultimodal)

utterances-bot updated 5 months ago
1
huggingface/transformers #31453

How to build and evaluate a vanilla transformer?

### Model description "Attention Is All You Need" is a landmark 2017 research paper authored by eight scientists working at Google, responsible for expanding 2014 attention mechanisms proposed by Bah…

Bachstelze updated 3 months ago
1
prakashpandey9/Text-Classification-Pytorch #9

the reference for the attention mechanism in LSTM_Attn

Would you please add the reference for the implementation details of the attention layer?

fmehralian updated 3 years ago
2
ROCm/MIOpen #2199

Request to Add Attention Kernel to MIOpen

Attention mechanisms are widely used in deep learning models, particularly in large language models. And a flexible attention kernel can help users to build accelerated language models conveniently on…

zjchen77 updated 5 months ago
4
tensorflow/nmt #480

Dead link in the NMT tutorial

The [Neural Machine Translation (seq2seq) Tutorial](https://github.com/tensorflow/nmt#background-on-the-attention-mechanism) contains a dead link under the **Background on the Attention Mechanism** se…

Rouizi updated 2 years ago
1
tencent-ailab/IP-Adapter #147

What is the role of num_tokens?

Thank you very much for your great work ! I encountered a problem while reading the source code: what is the role of num_tokens? I found the `num_tokens` parameter in the source code of `IPAttnPr…

TimeLessLing updated 10 months ago
1
luissen/ESRT #7

what is the difference between Feature Split (FS) in EMHA a…

Thank you for your work. after reading your paper, I have a question. In Feature Split (FS) of sec. 3.2.2 Efficient Transformer, I was confused with the difference between this FS and window-att…

rami0205 updated 2 months ago
1
torcheeg/torcheeg #39

Could not reproduce the accuracy of ViT

Is there anybody reproduced the accuracy of ARJUN et al’S VIT on DEAP datasets? In the related paper of "Introducing attention mechanism for EEG signals: Emotion recognition with vision transformer…

amrta-coder updated 1 week ago
4
anirbanl/sparsegen #1

Using sparsegen for attention probabilities

Hi! I'm trying to use these sparse functions as an alternative to the softmax function in the attention mechanisms of transformers. However, the loss becomes NaN in the first iteration... Do you know …

Labaien96 updated 2 months ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for attention-mechanism

1000+ results
for attention-mechanism