Hello,
I found that there is a description about VAD filter usage in README.md that may be inconsistent with the source code. I think "removes silence longer than 2 seconds" should probably use the argument min_speech_duration_ms rather than min_silence_duration_ms according to the source code.
In README.md :
VAD filter
...
The default behavior is conservative and only removes silence longer than 2 seconds. See the available VAD parameters and default values in the source code. They can be customized with the dictionary argument vad_parameters:
min_speech_duration_ms: Final speech chunks shorter min_speech_duration_ms are thrown out.
min_silence_duration_ms: In the end of each speech chunk wait for min_silence_duration_ms before separating it
min_speech_duration_ms: int (default - 250 milliseconds)
Final speech chunks shorter min_speech_duration_ms are thrown out
min_silence_duration_ms: int (default - 100 milliseconds)
In the end of each speech chunk wait for min_silence_duration_ms before separating it
Please let me know if this understanding is correct, looking forward to a reply, thanks~
Hello, I found that there is a description about VAD filter usage in README.md that may be inconsistent with the source code. I think "removes silence longer than 2 seconds" should probably use the argument
min_speech_duration_ms
rather thanmin_silence_duration_ms
according to the source code.In README.md :
I referred to the "Attributes" in vad.py :
and utils_vad.py in snakers4/silero-vad :
Please let me know if this understanding is correct, looking forward to a reply, thanks~