ddqn Search Results - Githubissues

278 results
for ddqn

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

notbot479/UnityCarGRPC #23

[ddpg-torch] need send adjustment (tanh output) for continui…

notbot479 updated 7 months ago
2
leela-zero/leela-zero #1109

FPU could be a trainable target

Not something I'd do for LZ right now, but an interesting idea to push things further: Someone on the Leela Chess Zero list asked: >Hello, I still can not understand the reason for using both Po…

gcp updated 5 years ago
45
TradeMaster-NTU/EarnHFT #1

No module named 'env'

I was trying to reproduce this work, but I encountered a problem with No module named 'env'. I checked the .gitignore file and found that it contained env/. I guess the error was caused here. (…

wangjitao1024 updated 9 months ago
1
ray-project/ray #3148

[rllib] Implement R2D2: Recurrent Experience Replay in Distr…

### Describe the problem The results for R2D2 are quite good: https://openreview.net/forum?id=r1lyTjAqYX We should add this as a variant of Ape-X DQN that supports recurrent networks. The high-l…

ericl updated 4 months ago
14
mendezja/Mariox #28

DDQN isn't learning as expected

The reward plot as shown above is decreasing rather than increasing over time. Could be due to hyperparameters chosen, or how the state features are preprocessed. Some ideas to try: - Preproc…

wxue24 updated 11 months ago
1
dennybritz/reinforcement-learning #30

DQN solution results peak at ~35 reward

Hi Denny, Thanks for this wonderful resource. It's been hugely helpful. Can you say what your results are when training the DQN solution? I've been unable to reproduce the results of the DeepMind p…

nerdoid updated 8 months ago
85
reiniscimurs/DRL-robot-navigation #76

Whether to support changing to another algorithm

Hi. Thank you for everything. This is great. I would like to ask if I can change the TD3 algorithm to other algorithms based on your work, such as DQN, DDQN and so on. I don't know much about this. I …

pzhabc updated 10 months ago
13
pytorch/rl #1722

[BUG] No real DDQN when using `delay_value`

## Describe the bug For an academic project, I wanted to compare few versions of DQN : - Vanilla DQN - DQN with a target network - Double DQN (therefore with a target network) By looking into…

Arlaz updated 11 months ago
2
stefan-jansen/machine-learning-for-trading #317

Chapter 22_deep_reinforcement_learning Google Colab Python 3…

**Describe the bug** A brief description of the bug and in which notebook/script it lives. 04_q_learning_for_trading Train Agent DDQNAgent.experience_replay() q_values[[self.idx, action…

martin0 updated 9 months ago
4
YangyangFu/mpc-drl-tl #125

Permission error when run testcase

I tried to run a test case (e.g. /workspaces/mpc-drl-tl/testcases/gym-environments/single-zone/test_action_v1/test_ddqn_tianshou.py) using cpu3 image, but it gave an error message on permission as fol…

mingzhe37 updated 11 months ago
2

上一页 1...8 9 10 11 12 13 14...28 下一页

278 results for ddqn

278 results
for ddqn