Hi,
I am testing the LG result in K2, but i found the result is bad and something cases are unusual. The result can be divided into two case.
Empty and Incomplete, like #403 .
Env setting:
zipformer, fast_beam_search_nbest_LG, and OOV word is not very much.
lastest K2 and icefall, logadd by default, like #277.
a. Empty
the whole decode result is empty, and in some cases, this can be alleviated when degrading the lm scale. unlike #277, the OOV is not the leading factor of this phenomenon.
e.g. src = ['这', ‘个’, ‘多’, ‘少’, ‘钱’, ‘啊’]
lm-scale-0.1-hyp = ['这', ‘个’, ‘多’, ‘钱’, ‘啊’]
lm-scale-0.15-hyp = []
lm-scale-0.2-hyp = []
...
b. Incomplete
the result result is incomplete. This can be found in the beginning, end and middle of a uttrance.
Also, if the avg duration of the testset audio & text is longer, the Incomplete case will much more common, resulting much more del error.
Hi, I am testing the LG result in K2, but i found the result is bad and something cases are unusual. The result can be divided into two case. Empty and Incomplete, like #403 .
Env setting: zipformer, fast_beam_search_nbest_LG, and OOV word is not very much. lastest K2 and icefall, logadd by default, like #277.
a. Empty the whole decode result is empty, and in some cases, this can be alleviated when degrading the lm scale. unlike #277, the OOV is not the leading factor of this phenomenon. e.g. src = ['这', ‘个’, ‘多’, ‘少’, ‘钱’, ‘啊’] lm-scale-0.1-hyp = ['这', ‘个’, ‘多’, ‘钱’, ‘啊’] lm-scale-0.15-hyp = [] lm-scale-0.2-hyp = [] ...
b. Incomplete the result result is incomplete. This can be found in the beginning, end and middle of a uttrance. Also, if the avg duration of the testset audio & text is longer, the Incomplete case will much more common, resulting much more del error.