alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Apache License 2.0
7.35k stars 1.04k forks source link

where to get the dictionary of vosk-model-en-us-0.22-lgraph #1555

Open YangangCao opened 2 months ago

YangangCao commented 2 months ago

Hi, dear author, I want to get the dictionary of vosk-model-en-us-0.22-lgraph to check every phone in a word, where can I get? Thanks very much

nshmyrev commented 2 months ago

It is inside the compilation package

https://alphacephei.com/vosk/models/vosk-model-en-us-0.22-compile.zip

YangangCao commented 2 months ago

ok got it, thanks for your quick and accurate reply!

YangangCao commented 2 months ago

Hi, sorry to bother you, I find some phone which is hard to read, for example electromagnetic electromagnetic @_B l_I E_I k_I t_I r_I oU_I m_I {_I g_I n_I E_I 4_I I_I k_E electromagnetic electromagnetic @_B l_I E_I k_I t_I r_I oU_I m_I {_I g_I n_I E_I t_I I_I k_E electromagnetic electromagnetic I_B l_I E_I k_I 4_I r_I oU_I m_I {_I g_I n_I E_I 4_I I_I k_E electromagnetic electromagnetic I_B l_I E_I k_I t_I r_I oU_I m_I {_I g_I n_I E_I t_I I_I k_E

what's 4_I ? And the word "electromagnetic" have 4 kinds of phone arrangements, I want to calculate GOP(goodness of pronunciation), how to decide the only one phone arrangement?

nshmyrev commented 2 months ago

what's 4_I ?

4 is a sampa phone something like "ch".

_I is for word internal, you are probably looking inside intermediate lexicon instead of original one.

I want to calculate GOP(goodness of pronunciation), how to decide the only one phone arrangement?

run alignment

YangangCao commented 2 months ago

thanks for your reply, it help me a lot

YangangCao commented 2 months ago

Hi dear author, why the vosk use different phone system with Kaldi? for example, "@", "{", "4" in vosk model, but doesn't in Kaldi model, Any idea to map from one to the other?

nshmyrev commented 2 months ago

That particular model uses different phoneset unfortunately. You can still map it easily, it is a simple mapping. Other models like gigaspeech use standard cmu dictionary

YangangCao commented 2 months ago

Thanks for you quick and accurate reply, I know gigaspeech model right know and it is good enough to me, I don't plan to map phoneset! thanks!