attention-lstm Search Results

1000+ results
for attention-lstm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #59530

Batch size is hardcoded using torch.jit.trace with LSTMCell

## 🐛 Bug Batch size is hardcoded when tracing a model using custom for loop with `nn.LSTMCell`. This makes it not possible to run model inference with different batch sizes. ## To Reproduce …

FangMath updated 6 months ago
12
BUCT-Vision/weekly-review #115

Review-2018.04.21-周阳

#### Review **1,训练模型** - 参考[DeepQA](https://github.com/Conchylicultor/DeepQA)([训练中文语料](https://github.com/qhduan/Seq2Seq_Chatbot_QA))改进模型，使用之前提到的dgk_shooter_min.conv语料 - 使用参数：lr=0.0003，训练5epoch，b…

narrowsnap updated 6 years ago
1
HuiResearch/cail2019_track2 #3

您好，我也是参加这个比赛的，研究了您的思路受益匪浅，想请教您一个问题

您将MAXPOOLing换成attention这里我看的有一些不太明白，不知道您可不可以帮我解释一下这部分的具体操作过程

pbz123 updated 4 years ago
3
hujingwen6666/MMGCN #4

Cannot reproduce MELD result

I can only got f1-score of 0.5760, here is the confusion matrix from stdout output log ``` precision recall f1-score support 0 0.7017 0.8503 0.7689 1…

sailist updated 1 year ago
14
datalogue/keras-attention #14

Attention on different input and output length

Hello Thanks a lot for providing easy to understand tutorial and attention layer implementation. I am trying to use attention on a dataset with different input and output length. My training dat…

aayushee updated 5 years ago
5
daveredrum/image-captioning #1

FC and Att2all

Hi, Thanks for your great work. I have a question that what is the difference between FC and Att2all? Thanks.

WangWenshan updated 5 years ago
1
arXivTimes/arXivTimes #662

Can Neural Networks Understand Logical Entailment?

## 一言でいうと文の論理構造を捉えるのに適したモデルを調査した研究。タスクは文間の関係推定で、関係性を捉えないと答えられないよう意図的に合成したデータセットを使用している。CNNやTransformerはあまり良好な結果でなくTreeLSTMと提案モデル(構造を表す?Worldと名付けられた潜在表現を経由して推定する)が良好 ### 論文リンク https://arxiv.o…

icoxfog417 updated 6 years ago
2
datalogue/keras-attention #39

Attention Decoder for OutputDimension in tens of thousands.

Hi Zafarali, I am trying to use your attention network to learn seq2seq machine translation with attention. My spurce lang output vocab is of size 32,000 and target vocab size 34,000. The following…

KushalDave updated 5 years ago
2
YeonwooSung/ai_book #15

Using Fast Weights to Attend to the Recent Past

The authors [1] propose "fast weights", a type of attention mechanism to the recent past that performs multiple steps of computation between each hidden state computation step in an RNN. The authors e…

YeonwooSung updated 3 years ago
2
devendrachaplot/DeepRL-Grounding #10

Dimension Error in Inference

Hi I was trying to run ' python a3c_main.py --evaluate 2 --load saved/pretrained_model' to run inference using the pre-trained model. However, I faced the following dimension error without changing…

Mrs-Hudson updated 4 years ago
3

上一页 1...23 24 25 26 27 28 29...100 下一页

1000+ results for attention-lstm

1000+ results
for attention-lstm