rmst / ddpg

TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)
MIT License
209 stars 64 forks source link

Output of actor will saturate #9

Open m5823779 opened 5 years ago

m5823779 commented 5 years ago

Hello~ I have some question about DDPG When my action dimension = 1, the result is good, but when my action dimension = 2 (the activation function is tanh and sigmoid), the output of actor will saturate. Here is the result what I said: https://github.com/m5823779/DDPG By the way, I use batch normalization only in my actor network. Do you know where is the problem?