Closed 5exceptions-rakeshdiwan closed 6 months ago
Big models are created for big servers, not for mobile phones.
To get suggestions on accuracy improvement you need to share sample audio data.
Can I share audio and other details here or we can use private email thread for that? Please assist me for this.
You can share here.
@nshmyrev Due to maintain confidentiality of work I’m not able to attach audio file here in this chat. Instead of it I’m sharing a drive link for all the required files in a folder. Please request for access on the same or give me email for access.
Folder contains:
Links
Thanks in advance.
It doesn't allow me to download, says I need to request access.
It doesn't allow me to download, says I need to request access.
The link is open can you please try now?
Well, you need better microphone. The current one is awful and cuts audio at 3khz. It has nothing about model size.
It has nothing about model size.
Can I have any reference audio file?
I've managed to record the phrases again please check and let me know if the cuts and pause time are correct. https://drive.google.com/drive/folders/1xtU0eSp8uf0wEfXHwRyIOw5pvxiCu0H8?usp=sharing
Now the audio quality is much better
Great, But I'm still facing the accuracy issue
@nshmyrev! Can you please advise us on accuracy and how can we resolve this?
Happy New Year @nshmyrev!
I'm the 5exceptions client :)
We'd greatly appreciate your guidance on how to improve the accuracy of these commands to support the voice activation-based app we're working on. Please let us know if there's a different preferred way to provide these audio files or anything else that will help move this to resolution. Ideally, this will be an approach that we can repeat when adding commands going forward.
Thank you!
@gvoll you can bias model to specific commands, see
https://alphacephei.com/vosk/lm
https://github.com/alphacep/vosk-api/blob/master/python/example/colab/vosk-adaptation.ipynb
Unable to integrate "vosk-model-en-us-0.22" language model with assets because of it's large size.
I'm working on one of the requirement of Android application and trying to integrate the large model into application package, I've encountered accuracy issues with small model so I switched to large ones. Please find details below about what I've tried so far:
java.lang.OutOfMemoryError: Java heap space
) with large models while placing model into assets directory.Please assits me to get more proper way to integrate VOSK into android or flutter to get the more accuracy at runtime.