shashikg / WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
MIT License
318 stars 32 forks source link

Language Auto Detection #1

Open brunjo opened 11 months ago

brunjo commented 11 months ago

I noticed that it would fallback to English if no language is specified. Is there a way to automatically predict the language?

shashikg commented 11 months ago

Hi @brunjo, yes, if you don't provide the language, it will fall back to English. I'll try adding that. However, the issue with this approach is that it complicates the batching of segments across multiple files.

AmgadHasan commented 4 months ago

Hi! Any update about this? I think it will need one additional step where we get the language before continuing decoding

ustclan commented 3 weeks ago

My solution is to refer to how WhisperX does it and then transplant it accordingly.