-
After I upgrade to pytorch 0.2
The example code in `405_DQN_Reinforcement_learning.py` is broken.
This is because `torch.max()` change it's return.
So the code need to change to run in pytorch …
-
The link https://goo.gl/uGOksc referring to a document 'Train neural nets to play video games' redirects to a non-existing notebook https://github.com/pytorch/tutorials/blob/master/Reinforcement%20(Q-…
-
Training Breakout goes ok but, memory usage exceeds 25gb after 4 hours of training on 16 cpu cores.
I wonder if it's related to sharing memory between processes.
I run Python 3.5 on scientific lin…
-
As you know, portfolio weights appear to static at test set.
Why didn't model learn so much?
I am assuming several reasons.
1. there are no ensemble in DDPG network
2. input doesn't include pr…
-
Test rl codes. But failed in actor critic. Any comments?
I have the latest version pytorch installed.
![image](https://cloud.githubusercontent.com/assets/5799436/25810029/69f6f046-3441-11e7-9ce3-2…
-
In examples/train_model.py there is the option to run validation every n seconds during training. However the model agent which observes the teacher's act containing the validation data still has data…
-
My mind has been on Futhark lately so I thought it would be time to open this issue in order to track the state of the eventual Cuda backend.
I've long been thinking about making a backend for Futh…
-
I try to eval your trained model, however the result has no effect:
```
2017-08-01 21:08:13,757 : reward sum: -21.0, reward mean: -21.0000
[2017-08-01 21:08:13,757] reward sum: -21.0, reward mean: …
-
Dear Kim,
thank you very much for sharing your implementation - I like it a lot :+1:
I'm trying to adapt the code to a parallel implementation to reproduce the Atari A3C experiments from the P…
-
Hi, I get free(): invalid pointer error running both the notebook and reinforcement_q_learning.py of pyTorch tutorials (the kernels dies and restarts, but seems a gym issue (see [here](https://github…