NaN Loss on train - Githubissues

bglick13 commented 7 years ago

I have been getting NaN losses when I try to train my agent on a game. I tracked it back to the get_batch function in memory - Y (the output of the model's predictions) turns to all NaNs about halfway through the first epoch. I haven't been able to figure it out from there, though.

Any suggestion would be much appreciated. This package is fantastic!

farizrahman4u commented 7 years ago

Please specify the model you are using.

bglick13 commented 7 years ago

Hi, thank you for the response.

I am using essentially the example from the read me, but with my own game. I figured out if I lower the learning rate dramatically to .001, it fixes the problem. Could there be something in the way I've designed my game that would cause the discrepancy? I'd rather use a slightly larger learning rate if possible.

Thanks again

farizrahman4u commented 7 years ago

Some sort of normalization of your reward might help. Paste your code here, I will take a look.

IntQuant commented 7 years ago

I found that in logs before loss became nan

code.py.zip

farizrahman4u / qlearning4k

NaN Loss on train #13