a3c-gpu Search Results - Githubissues

222 results
for a3c-gpu

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AI4Finance-Foundation/FinRL #454

Why an agent can't learn on cheat data?

I tried two different tune runs: with future data(you can find this code by "#Future data") and without. I expected to see a big difference in rewards betweent these two runs(or at lease some differen…

gendrelom updated 2 years ago
3
google-deepmind/mujoco #379

Python bindings won't compile - complaints of #pragma

Hi there I'm trying to use MuJoCo on a cluster (Compute Canada if it makes any difference). I am trying to compile the Python bindings for MuJoCo 2.2.0. Amazingly, they don't have the precompiled b…

scottastone updated 2 years ago
23
AI4Finance-Foundation/FinRL #447

"The kernel has died..." during Ray tune.run

I'm trying to start [notebook](https://colab.research.google.com/drive/1aQmH526cjCcB1JJZph2cywCBAjNiQgzW?usp=sharing) from [this article](https://medium.com/mlearning-ai/hyperparameter-optimization-us…

gendrelom updated 2 years ago
7
dennybritz/reinforcement-learning #34

A3C: We add entropy to the loss to encourage exploration

**I do not understand how adding entropy to loss will encourage exploration** I understand that Entropy is a measure of unpredictability, or measure of randomness. H(X) = -Sum P(x) log(P(x)) …

IbrahimSobh updated 2 years ago
20
ray-project/ray #6962

[RLlib] PPO torch over 5X slower than tensorflow on atari an…

### What is the problem? The recently added torch implementation of PPO #6826 is over 5X slower when training on atari (breakout) and also ends up slowly consuming all the system RAM (perf/ram_…

pmacalpine updated 2 years ago
29
pytorch/pytorch #32284

Memory leak in MaxPool2d

## 🐛 Bug It seems like there is very small memory leak during forward and backward propagation through network that can lead to memory overflow after many hours of training. I stumbled upon this…

m-bula updated 2 years ago
28
ray-project/ray #16578

[RLllib] Failed to Restore the model with A3C + Atari Env(Sp…

### What is the problem? when i do the restore the trained model, the error output as below: ``` File "/Library/Frameworks/Python.framework/Versions/3.8/lib/python3.8/site-packages/ray/tune/trai…

heavenlxj updated 3 years ago
3
ray-project/ray #18280

[rllib] A3C error: AttributeError: 'RolloutWorker' object ha…

OS: Windows 10 Tensorflow version: 2.6.0 (tensorflow-gpu) Python version: 3.9.7 Ray version:1.6.0 (installed from pip) A3C agent gives error on train() when I try to train multi-agent toy exampl…

yarik1988 updated 3 years ago
1
ray-project/ray #18245

[rllib] Reproduce issue: with the same seed, multiple runs s…

### What is the problem? I found multiple runs for PPO still have different performance even we set the same seed. How can we obtain the exact same result with the same seed? Current strate…

yangysc updated 2 years ago
18
PaddlePaddle/PARL #613

Does PARL support multi-GPU training in distributed RL setti…

Hey, you guys! I am fully impressed with your concise codes for implementing a distributed RL algorithm, like A3C. And I am very interested in whether this framework supports training with multiple …

terryzhao127 updated 3 years ago
4

上一页 1...5 6 7 8 9 10 11...23 下一页

222 results for a3c-gpu

222 results
for a3c-gpu