PsychoinformaticsLab / pliers

Automated feature extraction in Python
https://pliers.readthedocs.io/en/latest/
BSD 3-Clause "New" or "Revised" License
296 stars 68 forks source link

Add Speaker diarization model #490

Open adelavega opened 1 year ago

adelavega commented 1 year ago

SpeechBrain looks promising for speaker recognition / diarization among other speech related features

adelavega commented 1 year ago

https://speechbrain.readthedocs.io/en/latest/index.html

adelavega commented 1 year ago

A potential: https://ufarooqi.com/speaker-diarization-for-whisper-transcripts/

adelavega commented 1 year ago

Looks like speaker diarization is not great yet, especially w/ unknown number of speakers

qmac commented 1 year ago

I can attest to the quality of of Rev.ai speaker diarization, though at the moment it only comes as a package with transcription jobs. 😄

For free/open source, I've also seen some decent results with https://github.com/pyannote/pyannote-audio compared to speechbrain

adelavega commented 1 year ago

Thanks! Actually for our purposes I really wouldn't mind just paying for Rev on occasion. Relatively small amount of data.