alphacep / vosk-unity-asr

Automatic Speech Recognition in Unity using Vosk library
56 stars 16 forks source link

Cannot transcribe speech to text for model without RNNLM and rescore #4

Closed yli223 closed 1 year ago

yli223 commented 1 year ago

Hey, I implement a unity project that uses Vosk. The current model is around 5GB, which performs really well, but it is too large. It takes around 5 seconds to give the answer through Unity. So I remove RNNLM and rescore folders in my model. It works fine when I run it through python, which means the smaller model can give me the expected text based on my speech. But after integrating this model into unity, it cannot recognize the same input speech. I wonder if the model without RNNLM and rescore folders is supported in the unity project? If so, can anyone point me out why the model cannot perform the same as the python project?

nshmyrev commented 1 year ago

Removal of rnnlm and rescore should not affect results much, you probably changed some other things in process like data processing. I suggest you to start with a clean state again.