pytorch-reinforcement-learning Search Results

724 results
for pytorch-reinforcement-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MorvanZhou/PyTorch-Tutorial #9

pytorch 0.2 max method changed

After I upgrade to pytorch 0.2 The example code in `405_DQN_Reinforcement_learning.py` is broken. This is because `torch.max()` change it's return. So the code need to change to run in pytorch …

SSARCandy updated 7 years ago
1
pytorch/tutorials #100

invalid shortlink target

The link https://goo.gl/uGOksc referring to a document 'Train neural nets to play video games' redirects to a non-existing notebook https://github.com/pytorch/tutorials/blob/master/Reinforcement%20(Q-…

andreh7 updated 7 years ago
3
ikostrikov/pytorch-a3c #11

Possible memory leak?

Training Breakout goes ok but, memory usage exceeds 25gb after 4 hours of training on 16 cpu cores. I wonder if it's related to sharing memory between processes. I run Python 3.5 on scientific lin…

scientist1642 updated 7 years ago
14
wassname/rl-portfolio-management #2

Do you have any idea why model didn't learn much?

As you know, portfolio weights appear to static at test set. Why didn't model learn so much? I am assuming several reasons. 1. there are no ensemble in DDPG network 2. input doesn't include pr…

jd0713 updated 7 years ago
5
pytorch/examples #159

actor critic example failed

Test rl codes. But failed in actor critic. Any comments? I have the latest version pytorch installed. ![image](https://cloud.githubusercontent.com/assets/5799436/25810029/69f6f046-3441-11e7-9ce3-2…

tigerneil updated 7 years ago
2
facebookresearch/ParlAI #216

Validation agent has datatype train

In examples/train_model.py there is the option to run validation every n seconds during training. However the model agent which observes the teacher's act containing the validation data still has data…

Henry-E updated 7 years ago
9
diku-dk/futhark #434

Cuda backend

My mind has been on Futhark lately so I thought it would be time to open this issue in order to track the state of the eventual Cuda backend. I've long been thinking about making a backend for Futh…

mrakgr updated 6 years ago
6
dgriff777/rl_a3c_pytorch #5

I just want to say your trained model has no effect

I try to eval your trained model, however the result has no effect: ``` 2017-08-01 21:08:13,757 : reward sum: -21.0, reward mean: -21.0000 [2017-08-01 21:08:13,757] reward sum: -21.0, reward mean: …

lucasjinreal updated 7 years ago
20
kimhc6028/pathnet-pytorch #1

Help understanding the code?

Dear Kim, thank you very much for sharing your implementation - I like it a lot :+1: I'm trying to adapt the code to a parallel implementation to reproduce the Atari A3C experiments from the P…

AjayTalati updated 7 years ago
14
openai/gym #567

`python': free(): invalid pointer: Aborted (core dumped) Mak…

Hi, I get free(): invalid pointer error running both the notebook and reinforcement_q_learning.py of pyTorch tutorials (the kernels dies and restarts, but seems a gym issue (see [here](https://github…

NataliaDiaz updated 7 years ago
1

上一页 1...67 68 69 70 71 72 73...73 下一页

724 results for pytorch-reinforcement-learning

724 results
for pytorch-reinforcement-learning