请教：语种分类为什么不直接用softmax输出？

Snowdar / asv-subtools

An Open Source Tools for Speaker Recognition

Apache License 2.0

587 stars 135 forks source link

Closed jjjjohnson closed 2 years ago

jjjjohnson commented 2 years ago

你好！我看olr2021-baseline 语种分类用了xvector embedding + LDA + LR 的方法，但是xvector在训练的时候用 softmax 输出每个语种的概率计算CE进行训练的。为什么在inference的时候不直接用xvector 的 softmax的输出？

谢谢！

Snowdar commented 2 years ago

你好，这样也是可以的，但至于谁效果更好，则需要自行调试了。其中，用softmax的概率意味着基本只能是闭集预测，而后端通过模版匹配的方式提供了注册-验证机制，这样增加了更多的可能性。祝好！

jjjjohnson commented 2 years ago

很有帮助！谢谢解答！