KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
MIT License
2.09k stars 190 forks source link

Languages supported #13

Closed Zireael07 closed 1 week ago

Zireael07 commented 11 months ago

Which languages are supported?

KoljaB commented 11 months ago

Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, and Welsh.

skripnik commented 11 months ago

Here you can find all the language codes that Whisper supports: https://github.com/openai/whisper/blob/main/whisper/tokenizer.py

skripnik commented 11 months ago

It would be nice to mention allowed language values in README.md. Something like this: