Closed s1lviu closed 1 week ago
Do you have the probability number?
Do you have the probability number?
Around 65-70%.
same here. ringback tone detected as voice with 94% of probability.
The new VAD version was released just now - https://github.com/snakers4/silero-vad/issues/2#issuecomment-2195433115
It was designed with this issue in mind and performance on noise-only data was significantly improved - https://github.com/snakers4/silero-vad/wiki/Quality-Metrics
Can you please re-run your and tests and if the issue persists - please open a new issue
The Silero Voice Activity Detection (VAD) model incorrectly classifies the sound of blowing directly into the microphone as human speech. This issue persists across various blowing intensities, from soft to strong. The expected behavior is for the VAD to recognize these non-speech, noise-like sounds as non-voice activity.
Reproduction Steps
Web Application Test:
C++ Example Test:
Expected Behavior The VAD should classify the blowing sounds as non-voice activity, distinguishing them from human speech.
Actual Behavior The VAD system classifies the blowing sounds as voice activity, indicating a false positive in voice detection.