alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Apache License 2.0
7.35k stars 1.04k forks source link

Vosk for speech recognition on MCU #1564

Closed Taher0321 closed 3 days ago

Taher0321 commented 2 months ago

Hi, We are using vosk api in our voice to text conversion product which we will commercialize and sell in India and later in the world, cool?

nshmyrev commented 2 months ago

Absolutely! Let us know if you need any help, we are interested to get it running.

Taher0321 commented 2 months ago

Thanks

Taher0321 commented 1 month ago

Hi, We are facing so many difficulties in using Vosk, actually the device which will have Vosk API into it is a small wireless headphone like device, so it is being very difficult to find an MCU which can run big model like vosk, so do you: 1) Have any lighter version/solution that we can use in our device? 2) Do you suggest any MCU/microprocessor we can use that will run VOSK?

The language models on your website is very heavy, do you have an alternative solution?

Let me know if we are making any mistake here, just PL the VOSK API and put it into our device is all that needed, no need to use language models?

@nshmyrev

nshmyrev commented 1 month ago

@Taher0321 Vosk is not suitable for MCU and not designed for it as it doesn't have enough compute capabilities. If you describe your task better we can help you do decide the software. For example you can check https://github.com/ARM-software/ML-KWS-for-MCU

Taher0321 commented 1 month ago

@nshmyrev https://www.techecies.com/nekosa - This what we are building. Let me know how we can implement vosk in it.

Taher0321 commented 4 weeks ago

Hey, I am awaited for your response.

nshmyrev commented 4 weeks ago

Well, I think MCU has not enough compute capabilities for this task, you have to offload audio somewhere, at least on the phone. There are efficient recognizers for MCU, but they still limited for very small topic.

I know some people run Vosk on smart glasses, which is similar to your device, but they run full-scale Linux, not really and MCU.

Taher0321 commented 3 weeks ago

Do you think this MCU will work: https://robu.in/product/seeed-studio-xiao-esp32s3-sense-2-4ghz-wi-fi-ble-5-0-ov2640-camera-sensor-digital-microphone-battery-charge-supported-rich-interface-iot-embedded-ml/

I bet it will. What do you think?

nshmyrev commented 3 weeks ago

No. For AI you need something like https://www.mistywest.com/mistysom/

Taher0321 commented 4 days ago

Thanks but its not helping. I think latest nvidia jetson chips may help, we are studying it

nshmyrev commented 3 days ago

Ok, let us know how it goes