pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
http://pyannote.github.io
MIT License
6.2k stars 766 forks source link

Overlap in the audio #1543

Closed YashThakran closed 4 months ago

YashThakran commented 11 months ago

Hi, I have been using pyaanote for speaker diarization, and the results seem to be mostly good. However, I have observed some issues when there is an overlap of multiple speakers in the audio, pyannote labels it as any of the single speakers.

Is there any way to fix this problem or improve performance in case of overlaps?

I have tried embedding_exclude_overlap param but it doesn't help.

github-actions[bot] commented 11 months ago

Thank you for your issue. We found the following entries in the FAQ which you may find helpful:

Feel free to close this issue if you found an answer in the FAQ.

If your issue is a feature request, please read this first and update your request accordingly, if needed.

If your issue is a bug report, please provide a minimum reproducible example as a link to a self-contained Google Colab notebook containing everthing needed to reproduce the bug:

Providing an MRE will increase your chance of getting an answer from the community (either maintainers or other power users).

Companies relying on pyannote.audio in production may contact me via email regarding:

This is an automated reply, generated by FAQtory

hbredin commented 11 months ago

Your best guess is to finetune the segmentation model on your own data. See this tutorial.

Also, I'd be happy to have a look at a short excerpt of your file where this does not work as expected.

stale[bot] commented 5 months ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.