AlphaZeroIncubator / AlphaZero

Our implementation of AlphaZero for simple games such as Tic-Tac-Toe and Connect4.
0 stars 0 forks source link

Add custom loss function. Closes #1 #22

Closed guidopetri closed 4 years ago

guidopetri commented 4 years ago

L2 loss is being left to the optimizer, since it would require unpacking the parameters of the network every time (as far as I know). I really wanted to make it part of the loss function itself, but I can't find a way to do it otherwise.