Open Maldoror1900 opened 6 months ago
Good idea. I'll add support for this in the next release. Sorry for the delay in responding to you.
Yes! +1 on this! Just pass through. Any News on this. Could offer a PR as well.
@Vaibhavs10 Any news on this?
I have opened a PR to allow that: https://github.com/Vaibhavs10/insanely-fast-whisper/pull/180
I use this line of code to transcribe and diarize at the same time :
but I get more speakers than there are on the audio. Knowing that on
pyannote
, the parameter that handles this isnum_speakers=2
:diarization = pipeline("audio.wav", num_speakers=2)
, ordiarization = pipeline("audio.wav", min_speakers=2, max_speakers=5)
Is there a way to implement this parameter in the line, using
insanely-fast-whisper
?