noisynet Search Results

google/dopamine #189

NoisyNets implementation issues

I'm implementing my own RL framework in Jax to better understand RL algorithms and found your code very helpful Looking at the NoisyNets implementation, on line 316 and 317 (https://github.com/goog…

pseudo-rnd-thoughts updated 2 years ago

pytorch/pytorch #3625

Proposal: combine requires_grad and retain_grad()

Currently, requires_grad means two things: 1) That we should compute gradients for this variable and functions of this variable 2) On a "leaf" variable, it means we should store the gradient to the…

colesbury updated 1 year ago

Kismuz/btgym #97

Bug: noisy-net layer

hi @Kismuz I was reading the paper "Noisy Network for exploration". And have a question w.r.t its usage in btgym. The paper says that "As A3C is an on-policy algorithm the gradients are unbiased w…

mysl updated 5 years ago

NVIDIA/apex #503

Device mismatch when using AMP with Pytorch DataParallel

I'm running the following on 4 GPUs: ``` model = Resnet50() model = model.cuda() criterion = nn.CrossEntropyLoss(reduction='mean').cuda() optimizer = torch.optim.SGD(model.parameters(), 0.001) …

michaelklachko updated 3 years ago

jcwleo/random-network-distillation-pytorch #5

README asset

![image](https://user-images.githubusercontent.com/23333028/48664090-cf836680-eadc-11e8-969b-5201db99907d.png)

jcwleo updated 1 year ago

assume-framework/assume #398

Early Stopping, Learning rate and noise decay

Implement the best practices from multi-agent Rl community and stablebaselines3 into our algorithm. Further analyse similarities between petting zoo multi-agent implementation to current RL implementa…

kim-mskw updated 1 day ago

Kaixhin/Rainbow #51

Question regarding the commit : Fix stddev to match NoisyNet…

Hello, I see one commit (6c8b281) which tries to fix the default value of the stddev in Noisy layer but I think this is anyway overridden by the default value in args.py which is 0.1. Moreover the…

marintoro updated 3 years ago

medipixel/rl_algorithms #270

Loss jumps to a big number and then zero (Rainbow DQN)

I've working with the code in differences environments like pong and another NES games. But almost all the time I see the same pattern, the loss goes down normally but after some point it jumps to a v…

pedrolob updated 3 years ago

medipixel/rl_algorithms #305

Any support for other Atari environments

After changing environment to `Assault`, the algorithm no longer works, unlike other implementations. Is there any plan for supporting other environments in Atari?

TonyLianLong updated 3 years ago

kuto5046/papers #6

Noisy Networks for Exploration

Fortunato, Meire, Azar, Mohammad Gheshlaghi, Piot, Bilal, Menick, Jacob, Osband, Ian, Graves, Alex, Mnih, Vlad, Munos, Remi, Hassabis, Demis http://arxiv.org/abs/1706.10295

kuto5046 updated 4 years ago

23 results for noisynet

23 results
for noisynet