shashikg / WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
MIT License
315 stars 32 forks source link

speaker diarization #35

Open tristan-mcinnis opened 9 months ago

tristan-mcinnis commented 9 months ago

Thanks for putting so much work into this, its so polished already!

Just want to understand if speaker diarization is something planned in the future?

Thanks!

shashikg commented 8 months ago

Hi @nexuslux thanks for the interest. Speaker diarization is not planned somewhere near, I may add it later on. Current priorities are:

Once these are done, I will see if speaker diarization is something we should include in WhisperS2T.