awni / speech

A PyTorch Implementation of End-to-End Models for Speech-to-Text
Apache License 2.0
747 stars 175 forks source link

Loss is decrease but SER is increase #51

Open liangzhenlin123 opened 5 years ago

liangzhenlin123 commented 5 years ago

Hello, I used RNNT training on the Chinese speech recognition library of more than 300 hours (the encoder did pretrain, but the decoder is a random initialization parameter). After training dozens of epoch, the loss first quickly dropped from more than 1000 to 60. Then slowly dropped to more than 20, but the SER of inference has risen from 2 to 20. Is this normal? It seems that you mentioned this phenomenon elsewhere. Thank you very much!