Open Bigpig4396 opened 5 years ago
Yes, the problem is that the activation function is chosen incorrectly.
I don't think this repo implement the PPO correctly either
change the activation function relu to tanh
right,change relu to tanh in actor network
Yes, the problem is that the activation function is chosen incorrectly.