In faster-whisper there's a built-in VAD feature. It's set to default to falsehere
This feature should be helpful to handle instances which I believe are called "hallucinations" - the speaker goes silent for some time, then picks up again, but the resulting text is just repeating the last sentence before the person went silent.
Can a parameter be added to the REST API to optionally set this vad_filter feature to true and add vad_parameters?
In
faster-whisper
there's a built-in VAD feature. It's set to default tofalse
hereThis feature should be helpful to handle instances which I believe are called "hallucinations" - the speaker goes silent for some time, then picks up again, but the resulting text is just repeating the last sentence before the person went silent.
Can a parameter be added to the REST API to optionally set this
vad_filter
feature totrue
and addvad_parameters
?