KoljaB / RealtimeSTT

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
MIT License
1.64k stars 151 forks source link

Languages supported #13

Open Zireael07 opened 9 months ago

Zireael07 commented 9 months ago

Which languages are supported?

KoljaB commented 9 months ago

Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, and Welsh.

skripnik commented 8 months ago

Here you can find all the language codes that Whisper supports: https://github.com/openai/whisper/blob/main/whisper/tokenizer.py

skripnik commented 8 months ago

It would be nice to mention allowed language values in README.md. Something like this: