ddqn Search Results - Githubissues

277 results
for ddqn

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

MushroomRL/mushroom-rl #124

PPO for lunar lander [BUG]

I'm trying to use the PPO for the lunar lander but I can't find examples and my code doesn't seem to converge, can you spot the issue? some parameter is wrong? alg = PPO ``` from mushroom_rl.policy…

davidenitti updated 1 year ago
10
lgooo/SUMO-RL-Coverage #24

Multi-step DDQN

Currently, only a single step (s, a, r, s') is considered for training DDQN. There is empirical study that multi-step training performs better: https://rayyoh.github.io/files/2017-Rainbow.pdf Let'…

kihyukh updated 2 years ago
1
philtabor/Youtube-Code-Repository #50

Error when I changed dueling_ddqn_torch.py to get multiple d…

I want to implement a dueling double DQN algorithm for selecting multiple discrete actions. Since the existing dueling_ddqn_torch.py code is for choosing a single action, I should modify it. But when …

Nazanin-87 updated 2 years ago
3
Albert-Z-Guo/Deep-Reinforcement-Stock-Trading #12

Save DDQN

Hi! Thank you for your answering! I know what you mean,but my problem is that I modify model file names there like if model_name == 'DDQN': agent.model.save('saved_models/DDQN_ep' + str(e) + '.h5')…

yesungkeke updated 2 years ago
1
huggingface/deep-rl-class #64

Why Double DQN don't use the accumulated reward

The main goal of Deep Learning is to maximize the accumulate reward. In the Q-Learn we use the accumulate reward to update the Qtable. However, the DDQN use the instant reward instead of accumulated r…

Bob-AFei updated 2 years ago
3
sezan92/sezan92.github.io #14

Blog reinforce Discrete method

## Objective After discrete reinforce method of Reinforcement learning algorithm has been implemented. The next task is to make a blog about reinforce method. This issue is to work on that ## Tas…

sezan92 updated 1 year ago
43
TradeMaster-NTU/TradeMaster #94

ETEO Algorithm and data

Hi, I am trying to figure out ETEO algorithm for OE. However, could you please provide the source paper of ETEO algorithm, an arXiv link should be helpful. Also, could you please help to explai…

DeepAnonymous updated 1 year ago
21
DLR-RM/stable-baselines3 #996

[Question] Does DQN copy `running_mean` and `running_var` of…

I have been spending quite some time reading the codes here and I have been learning quite a lot so far. I got a small question when I backtrack some codes to find out why I got some unstable agents. …

honglu2875 updated 2 years ago
5
Albert-Z-Guo/Deep-Reinforcement-Stock-Trading #11

how to save DDQN model

Hi! Thank you for your last answer! Recently I try to train the DDQN in your project,so I write" if model_name == 'DDQN': …

yesungkeke updated 2 years ago
2
PaddlePaddle/PARL #677

AttributeError: 'AtariModel' object has no attribute 'value'

examples下的DQN_variant 使用DDQN报错 readme推荐环境是 + [paddlepaddle>=2.0.0](https://github.com/PaddlePaddle/Paddle) + [parl>=2.0.1](https://github.com/PaddlePaddle/PARL) + gym==0.18.0 + tqdm + atari-py==…

237035848 updated 2 years ago
4

上一页 1...11 12 13 14 15 16 17...28 下一页

277 results for ddqn

277 results
for ddqn