quetion about the loss and grad of "mbr"

hirofumi0810 / neural_sp

End-to-end ASR/LM implementation with PyTorch

Apache License 2.0

596 stars 141 forks source link

Open Cescfangs opened 3 years ago

Cescfangs commented 3 years ago

hirofumi0810 commented 3 years ago

@Cescfangs yes

Cescfangs commented 3 years ago

@Cescfangs yes

Thanks for the reply, and I'm curious about the improvement of this mWER tuning, say 5% relative wer reduction?

Cescfangs commented 3 years ago

Also, I am a little confused about the “mbr” loss, the inputs are not used in backward function, how does the grad flow to model params?