MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
BSD 2-Clause "Simplified" License
2.44k stars 238 forks source link

Issue with an audio/video file #156

Open dchapelet opened 5 months ago

dchapelet commented 5 months ago

Hi, From a video source, the speakers are always the same (Speaker = 0). From an audio file only, the diarization works very well. Should we always separate audio from a video file before diarization? Thank you very much for your answer. David.

MahmoudAshraf97 commented 2 months ago

Hi, can you upload a file to reproduce the issue?