Open adelavega opened 1 year ago
Looks like speaker diarization is not great yet, especially w/ unknown number of speakers
I can attest to the quality of of Rev.ai speaker diarization, though at the moment it only comes as a package with transcription jobs. 😄
For free/open source, I've also seen some decent results with https://github.com/pyannote/pyannote-audio compared to speechbrain
Thanks! Actually for our purposes I really wouldn't mind just paying for Rev on occasion. Relatively small amount of data.
SpeechBrain looks promising for speaker recognition / diarization among other speech related features