skit-ai / kaldi-serve

Server framework for Kaldi ASR Toolkit
Apache License 2.0
97 stars 24 forks source link

Improve confidence scoring algorithm #33

Open greed2411 opened 2 years ago

greed2411 commented 2 years ago

To be updated.

pskrunner14 commented 2 years ago

Improve on the current confidence scoring algorithm here. The current confidence scores aren't very informative in terms of relative importance of the resulting alternatives. This is because Kaldi's AM and LM scores are scaled and unbounded negative log-likelihoods. Write an algorithm to compute the aggregate %confidence scores that can help improve performance on downstream tasks like SLU by removing bat alternatives, e.g. confidence scores similar to that of Google ASR.

Relevant links:

  1. kaldi-help thread about confidence scores