lowerquality / gentle

gentle forced aligner
https://lowerquality.com/gentle/
MIT License
1.44k stars 295 forks source link

Can you identify speakers with gentle? #281

Open srijanb97 opened 4 years ago

srijanb97 commented 4 years ago

The text alignment tool works perfectly with pretty much accuracy. However, I have audio files where there is more than one speaker. Is there any way to detect which speaker is uttering each word?

effdotsh commented 3 years ago

That's not really what gentle is made for, but there are other tools you can use.

zxul767 commented 1 year ago

@srijanb97 look into "diarization". NeMo seems to be pretty good, but there are others (e.g., pyannote)