-
## [LangChain Development](https://app.pluralsight.com/library/courses/langchain-development/table-of-contents)
by [Tom Taulli](https://app.pluralsight.com/profile/author/tom-taulli)
founder : H…
-
Hi there,
I have been trying to convert a simple Keras BiLSTM (or LSTM) with Attention model to ONNX.
It keeps failing during onnx model save.
The error message I am getting is
TypeError: ob…
-
https://nlp.stanford.edu/pubs/acl2015_jiwei.pdf
-
Hello @pemami4911,
The problem really was with the mask. I've fixed it and the network started to learn. My Decoder now is:
```
class Decoder(nn.Module):
def __init__(self, feactures_dim,hid…
-
I started making a patch for CudnnLSTM support but in its current state it's quite a mess. However, I can say that step_size dropped from 0.64 to 0.23s in my testing and CPU usage is down from 1100% t…
-
I read the paper "Leveraging Context Information for Natural Question Generation".
Section 2.2 says:
> Each encoder state hj is the concatenation of two bi-directional LSTM states
(Section 3.2 …
-
![image](https://user-images.githubusercontent.com/38101748/120875897-c4cc4680-c5e0-11eb-9047-0498e1e9aed2.png
when I use train.py , It stopped automatically when step is 31. I changed several parame…
-
LSTM.apply use 'states', 'cells' as recurrent states and return them as output. I think returning 'cells' is unnecessary. It makes the LSTM interface different from that of GRU. And LSTM is incompatib…
-
I just finished reading your paper, and I notice that it is an on policy method.
And I wondering if anyone has tested it with an rl method that has a replay_buff pool.
As far as I know, for off…
-
您好,萌新问个问题。
大概看了一遍代码,请问划分后的原始数据是否已经做了截断和padding?但是在后面的模型部分,并没有看到对padding(比如Attention的softmax,LSTM的hn)的单独处理,请问是影响不大还是什么?