alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Apache License 2.0
7.35k stars 1.04k forks source link

Language Identification #1599

Closed andreystarenky closed 3 days ago

andreystarenky commented 6 days ago

Hello, I am wondering if there is any way to do Language Identification on the text before it is being transcribed? I am building an android app with a large model for one language, but I don't want it to pick up other speech in other languages and try and transcribe it in the main language (because it prints out nonsense of course). What is the best way to do this?

nshmyrev commented 3 days ago

Tracked in https://github.com/alphacep/vosk-api/issues/420