-
Thank you for the nice implementation. I'm curious about the running time on your machine. In https://github.com/ikostrikov/pytorch-a3c, it is reported that PongDeterministic-v3 is solved around 15min…
-
Could we put somewhere a list of projects that are using GPflow?
I am curious what people are doing with GPflow and it would help to advertise other projects building on GPflow.
If you think this …
-
I've been looking into using learning rate decay for RL, and I realized the method in #680 won't work for it. That method relies on first creating the TensorGraph so you can get a reference to its gl…
-
Hi Jingweiz,
this a great implementation of the `DNC`, thanks a lot for sharing it. I was wondering what's the recommended/simplest way to add new environments, (from `OpenAI`)? I guess it would ju…
ghost updated
7 years ago
-
As far as I can see, model hyperparameters are different.
Thanks.
-
Hi, Pong is a good sanity check. Has anyone tried/adopted the code (A3C-LSTM) on other Atari games like BreakoutDeterministic-v3 and SpaceInvadersDeterministic-v3, and managed to get average scores 50…
-
Thank you for this great repository.
I am implementing
[Curiosity-driven Exploration by Self-supervised Prediction](https://pathak22.github.io/noreward-rl/resources/icml17.pdf)
with pytorch, and th…
-
Dear Kim,
thank you very much for sharing your implementation - I like it a lot :+1:
I'm trying to adapt the code to a parallel implementation to reproduce the Atari A3C experiments from the P…
ghost updated
7 years ago
-
I'm referencing this issue:
https://github.com/tensorflow/tensorflow/issues/6360#issuecomment-271486091
-
[Layer normalization](https://arxiv.org/abs/1607.06450) seems to be pretty popular for RNNs nowadays, and it is worth having an implementation available. Several people seem to have already rolled the…