Open marvin-nj opened 5 years ago
Hello , Thank you for sharing ! the Gradient of my d_fk_loss disappear when i train my chinese experiment in the 22 epochs ,all config is initial.
so, what is wrong with it ?
Hello , Thank you for sharing ! the Gradient of my d_fk_loss disappear when i train my chinese experiment in the 22 epochs ,all config is initial.
so, what is wrong with it ?