Actor update equation in DDPG

germain-hug / Deep-RL-Keras

Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)

528 stars 149 forks source link

Actor update equation in DDPG #5

Closed ghost closed 5 years ago

ghost commented 5 years ago

Hi,

When you use the gradient of the critic to update the actor here, why do you put in the third parameter of tf.gradients() "-action_gdts" instead of "action_gdts". From where does the minus sign come ?

I double checked the formula and I still don't see why it is the case in your code.

Thanks!

germain-hug commented 5 years ago

Apologies for the late response, the negation here is a simple trick to compute the gradients using the loss function of the actor's network. This is similar to OpenAI's baselines

ghost commented 5 years ago

Thanks!