snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector
MIT License
3.38k stars 353 forks source link

Bug report — Incorrectly Detects Blowing into Mic as Voice #430

Closed s1lviu closed 1 week ago

s1lviu commented 3 months ago

The Silero Voice Activity Detection (VAD) model incorrectly classifies the sound of blowing directly into the microphone as human speech. This issue persists across various blowing intensities, from soft to strong. The expected behavior is for the VAD to recognize these non-speech, noise-like sounds as non-voice activity.

Reproduction Steps

Web Application Test:

C++ Example Test:

Expected Behavior The VAD should classify the blowing sounds as non-voice activity, distinguishing them from human speech.

Actual Behavior The VAD system classifies the blowing sounds as voice activity, indicating a false positive in voice detection.

DanyPell commented 3 months ago

Do you have the probability number?

s1lviu commented 3 months ago

Do you have the probability number?

Around 65-70%.

lionsheep0724 commented 3 months ago

same here. ringback tone detected as voice with 94% of probability.

snakers4 commented 1 week ago

The new VAD version was released just now - https://github.com/snakers4/silero-vad/issues/2#issuecomment-2195433115

It was designed with this issue in mind and performance on noise-only data was significantly improved - https://github.com/snakers4/silero-vad/wiki/Quality-Metrics

Can you please re-run your and tests and if the issue persists - please open a new issue