shashikg / WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
MIT License
239 stars 21 forks source link

speaker diarization #35

Open nexuslux opened 4 months ago

nexuslux commented 4 months ago

Thanks for putting so much work into this, its so polished already!

Just want to understand if speaker diarization is something planned in the future?

Thanks!

shashikg commented 4 months ago

Hi @nexuslux thanks for the interest. Speaker diarization is not planned somewhere near, I may add it later on. Current priorities are:

Once these are done, I will see if speaker diarization is something we should include in WhisperS2T.