recurrent-attention-model Search Results

942 results
for recurrent-attention-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

greenelab/deep-review #189

RETAIN: Interpretable Predictive Model in Healthcare using R…

[https://arxiv.org/pdf/1608.05745v3.pdf](https://arxiv.org/pdf/1608.05745v3.pdf) > Accuracy and interpretation are two goals of any successful predictive models. Most existing works have to suffer …

blengerich updated 7 years ago
3
nnop/notes #14

deep learning papers

# speech recognition - Soltau, Hagen, Hank Liao, and Hasim Sak. "Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition." arXiv preprint arXiv:1610.09975 (201…

nnop updated 6 years ago
8
Element-Research/rnn #180

Implementation of DRAM model

Hi, I'm trying to implement the Deep Recurrent Attention Model described in the paper http://arxiv.org/pdf/1412.7755v2.pdf to apply to image caption generation instead of image classification. I will …

vyouman updated 8 years ago
4
papers-we-love/nairobi #15

[suggestion] Attention Is All You Need

**Abstract:** > The dominant sequence transduction models are based on complex recurrent or convolutional neural networks that include an encoder and a decoder. The best performing models also conn…

profnandaa updated 6 months ago
1
braingineer/ikelos #4

Serialization of Attention Layer

This might be a Keras problem but have you tried serializing some of the layers? I tried the following to save a model that contains `SoftAttention`: ``` from keras.engine import Input, Model …

tokestermw updated 7 years ago
1
state-spaces/s4 #67

RL Tuning Tips

I'm currently writing a recurrent reinforcement library, with LSTMs, linear attention, etc that I would like to add S4 to. Unfortunately, I find S4D unable to learn in even simple RL tasks (e.g. outp…

smorad updated 1 year ago
5
Gitesh1998/Stock-prediction #8

Types of neural network.

Gitesh1998 updated 4 months ago
2
Jamie-Stirling/RetNet #37

Is Retnet equivalent to ordinary GPT when the decay is set t…

I'm a little confused of what retnet does in practice. Because in the formula ` Rentention(X) = (Q @ K.T * D) @ V`, if the *decay* is 1, the mathematical derivation of proving the equivalence between …

xuanyaoming updated 6 months ago
3
codeasmusic/mtl #1

Attention Layer

你好，我最近看了您的代码，关于Attention layer那一块，我不是很懂，主要是输入的shape以及中间各层的shape，还有每个变量的意义。读完还没有弄明白mtl的任务是什么，就是代码解决什么领域的问题。能否写点文档介绍一下呢，谢谢您。

s1162276945 updated 7 years ago
3
erodola/DLAI-s2-2021 #23

Financial Markets project Doubts

Hello Professor! I have a question about this subsection: compare the performance against the naive LSTM approach. Is there any specific architecture that I need to compare my target solution with?…

klaudiaplk updated 3 years ago
3

上一页 1...1 2 3 4 5 6 7...95 下一页

942 results for recurrent-attention-model

942 results
for recurrent-attention-model