How to map the transcribed text with their respective speakers in speaker diarization?

ThiruRJST commented 6 months ago

Tested versions

Reproducible in 3.1

System information

Ubuntu-20.04

Issue description

I installed PyAnnote-3.1 and used speaker-diarization-3.1 for my project purpose. When I ran the pipeline, it gave me a rttm file as an output but the speakers were not mapped with their respective texts transcribed from it. Is there any workaround to achieve this?

Minimal reproduction example (MRE)

https://colab.research.google.com/github/pyannote/pyannote-audio/blob/develop/tutorials/MRE_template.ipynb

hbredin commented 6 months ago

pyannote does not do speech-to-text (what was said?). It only does speaker diarization (who spoke when?).

ThiruRJST commented 5 months ago

@hbredin Thank you for the confirmation

pyannote / pyannote-audio