Closed jnhwkim closed 7 years ago
If the target is zero, loss and gradient of input are reminded to be zero. It is useful for variable-length NL generation models.
@jnhwkim Thanks!
I think nn should be consistent with this. I'll look into it, shortly.
nn
@jnhwkim I am working on nn version of this: https://github.com/torch/nn/pull/1215
If the target is zero, loss and gradient of input are reminded to be zero. It is useful for variable-length NL generation models.