saharmor / whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
MIT License
775 stars 141 forks source link

Use pyannote-audio for speaker diarization #10

Open saharmor opened 1 year ago

saharmor commented 1 year ago

Logic will be to combine Whisper + pyannote.audio based on timestamps to output something along the lines of:

Person A: Hi
Person B: Hello, how are you
Person A: I'm good, and you?
....
remic33 commented 1 year ago

I am working on the same subject, you can find work done by Majdoddin here: https://github.com/Majdoddin/nlp Not perfect but a good way to start. I ll push my solution when done

saharmor commented 1 year ago

@remic33 Cool - any plans to build into Whisper Playground?