Numeric underflow when loss gets small.

dhruvrajan / tensorflow-keras-java

MIT License

54 stars 9 forks source link

Open dhruvrajan opened 5 years ago

dhruvrajan commented 5 years ago

When training MNIST for > 14 epochs, there seems to be an underflow problem, resulting in NaNs when the loss becomes ~ 7E-3.