MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
BSD 2-Clause "Simplified" License
3.45k stars 291 forks source link

whisperx #133

Closed vladgrand2 closed 10 months ago

vladgrand2 commented 10 months ago

When I use whisperx transcribation I have ideal transcribation but diarization very bad. When I use diarize.py I got excellent diarization but very bad transcribe. But in both case used whisperx transcribe model. Maybe problem with aligment which realized throw deepmultilingualpunctuation?

It is possible to use whisperx transcribe and HF allign and then just diarize with Nemo without deepmultilingualpunctuation?