MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
BSD 2-Clause "Simplified" License
3.53k stars 301 forks source link

Diarization does not work for Russian? #50

Closed Blagochevsky closed 1 year ago

Blagochevsky commented 1 year ago

I run diarization on M1 Mac like this: python3 diarize.py -a trimmed_1min.wav --no-stem --whisper-model large-v2 --device cpu

I get this error: File "/Users/alexanderblagochevsky/Documents/whisper-diarization/diarize.py", line 160, in f'Punctuation restoration is not available for {whisper_results["language"]} language.' TypeError: list indices must be integers or slices, not str

I've tried commenting out punctuation part of the diarization.py and the script completes without error but in the output I get the whole conversation as only 1 speaker.

MahmoudAshraf97 commented 1 year ago

Fixed, Thanks