-
Hello,
I have a problem conducting layer feature attribution for my GRU model.
```
from captum.attr import LayerIntegratedGradients
ig = LayerIntegratedGradients(foreward_func, model.encode…
-
Hi there,
So it looks like in the latest version of TF (1.0?) - tf.nn.rnn.LSTMcell() has been replaced with tf.contrib.rnn.LSTMcell() - I just upgraded TF today and came across this,
[https:/…
-
http://arxiv.org/pdf/1512.05287v1.pdf
Yarin Gal has a paper describing an LSTM that uses a "fixed" dropout mask to remove LSTM cells. It would be cool to have an implementation of this in theanets.
-
Thanks so much for your code hunkim! It is very helpful!
Can I ask a quick question please? Am I right to think: within one batch, every time you feed `model.initial_state: state` to `model`, it over…
-
What is the reason for not incorporating/benchmarking `BackwardWeights` at least for NVIDIA? There is no use of `cudnnRNNBackwardWeights`.
-
There is a big interest in having support for RNN loops in TC. Creating this master task to discuss further and track progress
-
Dear TA,
I got below run time error.
RuntimeError: Expected hidden size (1, 1L, 512), got (1L, 50L, 512L)
**Below is my source code which I confirmed final rnn_input is 1 x batch x input_size**…
-
Hello,
Thanks for open sourcing the code.
After your commit:
https://github.com/melodyguan/enas/commit/2734eb2657847f090e1bc5c51c2b9cbf0be51887
I get 63.26 in ppl and not the 55.6 stated in the …
-
I want to run a RNN (https://fluxml.ai/Flux.jl/stable/models/recurrence/) on the GPU, using the explicit (https://fluxml.ai/Flux.jl/stable/training/training/#Implicit-or-Explicit?) gradients.
This …
-
Not sure if this RNN counts as a LLM, but if so would be nice to have it, let me know what needs to be done with packaging.
https://www.rwkv.com/