Waikato / wekaDeeplearning4j

Weka package for the Deeplearning4j java library
https://deeplearning.cms.waikato.ac.nz/
GNU General Public License v3.0
184 stars 197 forks source link

NaNs in model output, likely caused by arithmetic underflow #72

Open andriuslauraitis opened 2 years ago

andriuslauraitis commented 2 years ago

Hello,

I'm getting this error in deep neural network training. How can i retune the network to avoid this error?

Best regards, Andrius L.

basedrhys commented 2 years ago

Hi,

Unfortunately, we haven't found an easy solution - the things you can try is using a different learning rate or pretrained model.

Best, Rhys