saharmor / whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
MIT License
777 stars 140 forks source link

Manage speakers manually for real-time refinement & avoiding speaker swapping #27

Open ethanzrd opened 1 year ago

ethanzrd commented 1 year ago

Managing speakers manually will allow to signal duplicate speaker detections to the client when using the real-time mode, and to mitigate speaker swappings when using the sequential mode.