attention-seq2seq Search Results

OpenNMT/CTranslate2 #1752

Flash Attention regurgitates repeated tokens - seq2seq

Having some generation issues with NMT models trained with OpenNMT-py, which include OpenNMT-py versions before flash attention existed and one I'm currently training with the most recent which includ…

ArtanisTheOne updated 3 months ago

tensorflow/tensorflow #57833

Error getting persistent gradients from model converted from…

Click to expand! ### Issue Type Bug ### Source binary ### Tensorflow Version tf 2.10 ### Custom Code Yes ### OS Platform and Distribution Ubuntu 18.04 ### Mobile de…

NoAchache updated 1 hour ago

roholazandie/EmpTransfo #6

seq2seq + attention model

Hello, is the dataset used in seq2seq + attention in the paper multiple rounds or single theory? Is the multi-round divided into a single round?

Dei6 updated 2 years ago

brightmart/text_classification #64

a1_seq2seq_attention_train

当运行a1_seq2seq_attention_train.py文件时遇见下面的错误。希望得到您的帮助。 ValueError: Variable W_initial_state1 already exists, disallowed. Did you mean to set reuse=True in VarScope? Originally defined at: File "/…

tangdouer updated 6 years ago

farizrahman4u/seq2seq #67

Visual Attention Via Seq2Seq with Attention

Thanks for your awesome contribution. I was wondering whether I can use this to achieve visual attention. I was thinking of using the seq2seq with attention and feeding the convnet's flatten layer as …

AntreasAntoniou updated 8 years ago

graykode/nlp-tutorial #31

Seq2Seq(Attention)Input Shape Question

Seq2Seq(Attention)\Seq2Seq(Attention)-Tensor.py The shape of the input should be [max_time, batch_size,...]. The input = tf. transpose (dec_inputs, [1, 0, 2]) has already been transformed. In tf. e…

dpyneo updated 4 years ago

antonsysoev/DeepLearning #6

Лекция 6. Seq2Seq и Attention

antonsysoev updated 1 year ago

Conchylicultor/DeepQA #41

Memory chunks overflow when using tf.nn.seq2seq.embedding_at…

Hi, I'm running the project from source (master) using Python 3.5, and when I change the model from: `tf.nn.seq2seq.embedding_rnn_seq2seq` to `tf.nn.seq2seq.embedding_attention_seq2seq` o…

devinbostIL updated 7 years ago

graykode/nlp-tutorial #75

Faster attention calculation in 4-2.Seq2Seq?

Thanks for sharing! Just found out `Attention.get_att_weight` is calculating attention in a for-loop? this looks rather slow isn't it? `4-2.Seq2Seq(Attention)/Seq2Seq(Attention).ipynb` ```pyth…

shouldsee updated 7 months ago

smafjal/Bengali-Machine-Translation-seq2seq-with-attention #1

RuntimeError: 1D tensors expected, got 2D

Traceback (most recent call last): File "train.py", line 139, in main() File "train.py", line 116, in main epoch_loss = train(input_variable, target_variable, encoder, decoder, encode…

paul-pias updated 5 months ago

1000+ results for attention-seq2seq

1000+ results
for attention-seq2seq