-
tflearn.dropout(network, 0.5)
In this line of code, does the 0.5 represent the probability that a neuron will be dropped or is it the probability that a neuron will not be dropped?
-
COMMAND: python main.py --encoder cnn --decoder rnn --encoder-dropout 0.05 --decoder-dropout 0.2
Namespace(batch_size=1, cuda=True, decoder='rnn', decoder_dropout=0.2, decoder_hidden=256, dims=6, …
-
What's the reasoning behind the extra dropout layer after projection?
Karpathy's implementation has 2 dropout layers:
1. `attn_dropout`
2. `resid_dropout`
Karpathy's 2nd dropout layer
https…
-
I search "dropout" in the project. I only find this in lstm model .
I think add some structure to prevent from overfitting is necessary.
In my train of cross_encoder model, i find this have negative e…
-
### 🐛 Describe the bug
https://github.com/pytorch/pytorch/pull/100064 caused 3% perf drop and 4% memory drop in HF training because of disabled low memory dropout
### Versions
master
cc @ezy…
-
Hello there,
**Is your documentation request related to a problem? Please describe.**
I've been having difficulty in wrapping my pytorch forecasting models to use with your library. While I hav…
-
### 🚀 The feature, motivation and pitch
It's possible that such an inplace fusion is already possible with dynamo, but if not, it's quite good to have for saving memory. Also maybe worth introducin…
-
The the example for [time series classification with transformer](https://keras.io/examples/timeseries/timeseries_classification_transformer/), the the function `build_model()` is defined as:
```
…
-
I am encountering an index error when running [this](https://flax.readthedocs.io/en/v0.8.3/experimental/nnx/nnx_basics.html#:~:text=from%20functools%20import%20partial) example in the documentation
I…
-
Thank you very much for sharing the whole implementation! I am curious about `dropout step` in this figure, may I ask some questions?
![image](https://user-images.githubusercontent.com/14788650/5874…
luzai updated
2 years ago