-
I'm trying to use the PPO for the lunar lander but I can't find examples and my code doesn't seem to converge, can you spot the issue? some parameter is wrong?
alg = PPO
```
from mushroom_rl.policy…
-
Currently, only a single step (s, a, r, s') is considered for training DDQN.
There is empirical study that multi-step training performs better: https://rayyoh.github.io/files/2017-Rainbow.pdf
Let'…
-
I want to implement a dueling double DQN algorithm for selecting multiple discrete actions. Since the existing dueling_ddqn_torch.py code is for choosing a single action, I should modify it. But when …
-
Hi! Thank you for your answering!
I know what you mean,but my problem is that I modify model file names there like
if model_name == 'DDQN':
agent.model.save('saved_models/DDQN_ep' + str(e) + '.h5')…
-
The main goal of Deep Learning is to maximize the accumulate reward. In the Q-Learn we use the accumulate reward to update the Qtable. However, the DDQN use the instant reward instead of accumulated r…
-
## Objective
After discrete reinforce method of Reinforcement learning algorithm has been implemented. The next task is to make a blog about reinforce method. This issue is to work on that
## Tas…
-
Hi,
I am trying to figure out ETEO algorithm for OE. However, could you please provide the source paper of ETEO algorithm, an arXiv link should be helpful.
Also, could you please help to explai…
-
I have been spending quite some time reading the codes here and I have been learning quite a lot so far. I got a small question when I backtrack some codes to find out why I got some unstable agents.
…
-
Hi! Thank you for your last answer!
Recently I try to train the DDQN in your project,so I write" if model_name == 'DDQN':
…
-
examples下的DQN_variant 使用DDQN报错
readme推荐环境是
+ [paddlepaddle>=2.0.0](https://github.com/PaddlePaddle/Paddle)
+ [parl>=2.0.1](https://github.com/PaddlePaddle/PARL)
+ gym==0.18.0
+ tqdm
+ atari-py==…