the LG result is weird - Githubissues

Hi, I am testing the LG result in K2, but i found the result is bad and something cases are unusual. The result can be divided into two case. Empty and Incomplete, like #403 .

Env setting: zipformer, fast_beam_search_nbest_LG, and OOV word is not very much. lastest K2 and icefall, logadd by default, like #277.

a. Empty the whole decode result is empty, and in some cases, this can be alleviated when degrading the lm scale. unlike #277, the OOV is not the leading factor of this phenomenon. e.g. src = ['这'， ‘个’， ‘多’， ‘少’， ‘钱’， ‘啊’] lm-scale-0.1-hyp = ['这'， ‘个’， ‘多’， ‘钱’， ‘啊’] lm-scale-0.15-hyp = [] lm-scale-0.2-hyp = [] ...

b. Incomplete the result result is incomplete. This can be found in the beginning, end and middle of a uttrance. Also, if the avg duration of the testset audio & text is longer, the Incomplete case will much more common, resulting much more del error.

k2-fsa / icefall

the LG result is weird #839