shashikg / WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
MIT License
295 stars 30 forks source link

Language Auto Detection #1

Open brunjo opened 10 months ago

brunjo commented 10 months ago

I noticed that it would fallback to English if no language is specified. Is there a way to automatically predict the language?

shashikg commented 10 months ago

Hi @brunjo, yes, if you don't provide the language, it will fall back to English. I'll try adding that. However, the issue with this approach is that it complicates the batching of segments across multiple files.

AmgadHasan commented 3 months ago

Hi! Any update about this? I think it will need one additional step where we get the language before continuing decoding