mpnunez / Connect4-AI

Training an AI Player to play Connect4
0 stars 0 forks source link

Apply gradient clipping #9

Closed mpnunez closed 2 months ago

mpnunez commented 2 months ago

Q predicitons with MSE cost function often explode when the policy gets stuck

mpnunez commented 2 months ago

This did not help when I tried it. Gradient clipping is used more for RNN and very deep networks. There is some other issue causing loss function explosion in our DQN algorithm.