Closed huyangc closed 7 years ago
I set the parameters as : beta:1000 margin:4 scale=0.9997 beta_min=5
And after some iteration, the cross entropy loss suddenly become nan. Using the C++ layer and compile with mxnet. Does anyone have idea of this situation and how to solve it?
When using caffe code with the same parameter of asoftmax, model and dataset, it will be all right.
I set the parameters as : beta:1000 margin:4 scale=0.9997 beta_min=5
And after some iteration, the cross entropy loss suddenly become nan. Using the C++ layer and compile with mxnet. Does anyone have idea of this situation and how to solve it?