Open sh1man opened 1 month ago
No Hard limit, depends on the ram and vram you are using
why is VAD from Silero not used in this project?
Vad is used in transcription and diarization, if you are asking why not Silero specifically, it'll be difficult to replace the built-in VAD in these modules and there's no incentive to try doing that
Whisper VAD integration https://github.com/ANonEntity/WhisperWithVAD/blob/main/WhisperWithVAD.ipynb
At whisperX repository I saw information that they want to "Allow silero-vad as alternative VAD option"
How many maximum seconds are the maximum seconds limit for a file ?