Open 1204271075 opened 5 years ago
hi , when i train the No pre-trained language model ,why the nll loss is Nan sometimes ?
hi , when i train the No pre-trained language model ,why the nll loss is Nan sometimes ?