-
Did anyone managed to get the A3C LSTM of this repo to work for Pong (using the openai gym)?
I have already tried several different optimizers, learning rates, network architectures, but still no …
-
I've got this error,
The kernel appears to have died. It will restart automatically.
Could you help me out??
-
I need to get a copy of `shared` neural network of type `torch::nn::Sequential`. It seems that there is no available API for this purpose at the moment. It seems that declaring and instantiating the n…
-
After value, logit, (hx, cx) = model((Variable(state.unsqueeze(0)),(hx, cx))) in train.py, the program doesn't go on. Do you have any idea?
-
This thread is used for sharing experiment results. I'd appreciate if you could write your experiment result to this thread when you try my code. The following messages are sample reports.
-
### What happened + What you expected to happen
Hi, I am using a self-play scheme on SImple_tag_v2 of Pettingzoo, that works on a previous installation of ray_300_dev0 and al old ray 1.2.0 (with modi…
-
你好,非常感谢您能分享代码,但是我们在训练结果性能较低,请问可以提供一下您训练好的模型吗?想做进一步的测试,非常感谢。
-
Hi @dgriff777 . Thank you for your repo. It's great that it can achieve such a high score. But I met a problem when I try to apply it to MsPacman-v0.
I simply used this command `python main.py --e…
-
### What happened + What you expected to happen
The error appears after about 100k episodes with APPO, PPO, A3C, Impala and maybe other algorithms, the RAM and GPU resources are used only about 5…
-
## 🚀 Feature
The RL implementations added do not have the num_workers option. I have a feeling this is because the code doesn't support a shared replay buffer.
### Motivation
Adding this would e…