dueling-dqn Search Results

dxyang/DQN_pytorch #1

Dueling dqn equation

Thanks for offering this wonderful code. But I have a question. 1. Why in the combination part of the equation, the advantage A need to subtract it's average? I've already refer to the paper but sti…

HencyChen updated 4 years ago

philtabor/ProtoRL #4

seems like dqn use_dueling = True is broken

I changed the parameter in examples/dqn.py to this and I get an error: ``` def main(): env_name = 'CartPole-v1' # env_name = 'PongNoFrameskip-v4' use_prioritization = True use_…

jt70 updated 3 months ago

p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #36

Some questions abot Dueling DQN

Hello, I am new in this field, and my question is: where I can find your structure of your Neural Network in your code about the algorithm Dueling DQN? Thanks very much!!

Yang-Yefeng updated 3 years ago

PaPaPaPatrick/NN #43

强化学习

PaPaPaPatrick updated 1 month ago

dxyang/DQN_pytorch #2

Runtime Error : Index tensor must have same dimensions as in…

Hi If i run the code for breakout, i am getting the following error. Traceback (most recent call last): File "main.py", line 120, in main() File "main.py", line 117, in main atari_…

Ashutosh-Adhikari updated 6 years ago

DLR-RM/stable-baselines3 #622

**Important Note: We do not do technical support, nor consulting** and don't answer personal questions per email. Please post your question on the [RL Discord](https://discord.com/invite/xhfNqQv), [R…

araffin updated 2 months ago

AI4Finance-Foundation/ElegantRL #355

A ERROR about test

当使用demo_DQN_Dueling_Double_DQN 训练结束的的pt文件无法作为测试时的权重文件，是否需要将保存pt文件由torch.save(actor, actor_path) 更改为torch.save(actor.state_dict(), actor_path)

guest-oo updated 4 months ago

awjuliani/DeepRL-Agents #70

Target network updates / Double-Dueling-DQN.ipynb

From my understanding the target network updates are implemented wrong in the notebook Double-Dueling-DQN.ipynb. As it updates the same step as the main network (every 4th). In this simple environmen…

sanderfeliz updated 5 years ago

boyu-ai/Hands-on-RL #40

Dueling DQN部分的疑问

在介绍Dueling DQN的部分，描述到”在同一个状态下，所有动作的优势值之和为 0，因为所有动作的动作价值的期望就是这个状态的状态价值。“，我的理解是所有动作的优势值在策略 pi 下的期望为0，而不是之和为0？不知道我的理解有没有问题。

Ruanzhh updated 1 year ago

eleurent/rl-agents #113

Algorithmic issues

{ "base_config": "configs/HighwayEnv/agents/DQNAgent/ddqn.json", "model": { "type": "EgoAttentionNetwork", "embedding_layer": { "type": "MultiLayerPerceptron",…

AHPUymhd updated 5 months ago

243 results
for dueling-dqn