-
- [ x] I am on the [latest](https://github.com/sdispater/pendulum/releases/latest) Pendulum version.
- [ x] I have searched the [issues](https://github.com/sdispater/pendulum/issues) of this re…
-
Hi, I have used your code to solve another continuous control task in openai/gym, Pendulum-v0. However, the result was quite bad. I didn't use the rllab environment, just using the simple gym with som…
-
Thanks for the nice code. I am trying to re-produce the result in "Pendulum-V0" using a3c_cont.py but it seems the model fail to converge. I have tried various method like experience reply but still n…
-
## Motivation
Please outline the motivation for the proposal.
Is your feature request related to a problem? e.g., "I'm always frustrated when [...]".
If this is related to another issue, please l…
-
Hello buddy, why does DDPG work so poorly on InvertedPendulum-v4, do you have any good suggestions。
200 rounds of training, and I still kept falling to the floor during the final demonstration
-
I ran ppo1 for Pendulum-v0
however, it does not work.... not converge...
Could someone have a solution?
-
Hi team,
Thanks for sharing the great work! I have tried reproducing the PendulumSwingup experiments, both continuous and discontinuous. I just used the scripts and codes you gave, without any mod…
-
- [ x] I am on the [latest](https://github.com/sdispater/pendulum/releases/latest) Pendulum version.
- [x ] I have searched the [issues](https://github.com/sdispater/pendulum/issues) of this re…
-
# [강화학습] REINFORCE로 Pendulum-v0 환경 제어해보기 - 재야의 숨은 초보
[강화학습] REINFORCE로 Pendulum-v0 환경 제어해보기
[https://hiddenbeginner.github.io/rl/2022/10/24/Pendulum_with_REINFORCE.html](https://hiddenbeginner.githu…
-
Hi @nslyubaykin
lstm+ppo cannot converge in Pendulum-v0 environment, I don't know there is some setting error in my code, could you check it for a moment?
reward curve shown as below:
![image](htt…