Open csingh27 opened 3 years ago
5Y : Actor loss inc. Critic value dec. State, action pair not good Action not defined correctly OR action defined correctly but policy network returns the wrong actions ?
Things to try : Reduce batch size Add more layers to the network Check gradients Check weights initialization
5Y : Actor loss inc. Critic value dec. State, action pair not good Action not defined correctly OR action defined correctly but policy network returns the wrong actions ?