Carsonthemonkey / GIST

App to summarize audio files for the LC ACM spring 2023 hackathon
MIT License
3 stars 0 forks source link

Add speaker diarization #91

Open Carsonthemonkey opened 1 year ago

Carsonthemonkey commented 1 year ago

Differentiating between speakers in the transcript would be a pretty helpful feature. Someone made an implentation using whisper and pyannote here. There is also whisperX which might be easier if I can get it working.