attention-lstm Search Results

1000+ results
for attention-lstm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kurtzace/diary-2024 #11

Gen AI - LLM, RAG, Langchain

## [LangChain Development](https://app.pluralsight.com/library/courses/langchain-development/table-of-contents) by [Tom Taulli](https://app.pluralsight.com/profile/author/tom-taulli) founder : H…

kurtzace updated 1 week ago
5
onnx/keras-onnx #293

Error converting Keras model BILSTM with Attention custom la…

Hi there, I have been trying to convert a simple Keras BiLSTM (or LSTM) with Attention model to ONNX. It keeps failing during onnx model save. The error message I am getting is TypeError: ob…

yuvalshachaf updated 4 years ago
2
AkihikoWatanabe/paper_notes #74

A hierarchical neural autoencoder for paragraphs and documen…

https://nlp.stanford.edu/pubs/acl2015_jiwei.pdf

AkihikoWatanabe updated 6 years ago
2
pemami4911/neural-combinatorial-rl-pytorch #5

mask

Hello @pemami4911, The problem really was with the mask. I've fixed it and the network started to learn. My Decoder now is: ``` class Decoder(nn.Module): def __init__(self, feactures_dim,hid…

ricgama updated 6 years ago
35
tensorflow/nmt #254

Include ability to use CudnnLSTM for max GPU utilization

I started making a patch for CudnnLSTM support but in its current state it's quite a mess. However, I can say that step_size dropped from 0.64 to 0.23s in my testing and CPU usage is down from 1100% t…

xtknight updated 6 years ago
3
freesunshine0316/MPQG #9

encoder state in `matching_encoder_utils.py`

I read the paper "Leveraging Context Information for Natural Question Generation". Section 2.2 says: > Each encoder state hj is the concatenation of two bi-directional LSTM states (Section 3.2 …

pjlintw updated 5 years ago
4
nanguoshun/LSR #43

about the bug when training the model

![image](https://user-images.githubusercontent.com/38101748/120875897-c4cc4680-c5e0-11eb-9047-0498e1e9aed2.png when I use train.py , It stopped automatically when step is 31. I changed several parame…

fresh382227905 updated 3 years ago
1
mila-iqia/blocks #1025

LSTM.apply interface. recurrent.states and recurrent.outputs

LSTM.apply use 'states', 'cells' as recurrent states and return them as output. I think returning 'cells' is unnecessary. It makes the LSTM interface different from that of GRU. And LSTM is incompatib…

Beronx86 updated 8 years ago
1
jerrodparker20/adaptive-transformers-in-rl #15

Is this algorithm suitable for off-policy policy?

I just finished reading your paper, and I notice that it is an on policy method. And I wondering if anyone has tested it with an rl method that has a replay_buff pool. As far as I know, for off…

dbsxdbsx updated 4 years ago
1
moneyDboat/data_grand #12

请教个问题

您好，萌新问个问题。大概看了一遍代码，请问划分后的原始数据是否已经做了截断和padding？但是在后面的模型部分，并没有看到对padding（比如Attention的softmax，LSTM的hn）的单独处理，请问是影响不大还是什么？

guofei1989 updated 4 years ago
1

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for attention-lstm

1000+ results
for attention-lstm