[Feature Request] Constrain Available Languages when Autodetecting Language

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

MIT License

12.59k stars 1.05k forks source link

[Feature Request] Constrain Available Languages when Autodetecting Language #1164

Open WesleyFister opened 10 hours ago

WesleyFister commented 10 hours ago

Currently Faster-Whisper only allows you to specify a single language or attempt to detect the language out of a pool of 94 languages. I would like to be able to limit what languages can be detected. Something like the following to limit autodetection to only English, Spanish and French. model.transcribe("audio.mp3", beam_size=5, language=["en", "es", "fr"])