-
I want to run your tts model, I compiler sherpa-onnx follow the steps:
```git clone https://github.com/k2-fsa/sherpa-onnx
cd sherpa-onnx
mkdir build
cd build
cmake -DCMAKE_BUILD_TYPE=Release ..
…
-
## 🐛 Bug
Trying to run the language (99% accuracy, 95 languages version) sample code provide in https://github.com/snakers4/silero-vad/wiki/Other-Models throws RuntimeError "stft requires the retur…
-
While the VAD (especially the micro one) was explicitly designed for IOT / edge / mobile use cases, we do not have the resource or expertise to provide instructions for corresponding ARM / mobile buil…
-
Thank you for releasing the code
since this implementation require less memory than other implementation
adding VAD (Voice activity detection) should be more suitable
Voice activity detection mak…
-
Wonderful pipeline, thanks a lot for your great work! It was a bit painful to set up in my case but now it works like a charm. Still I am curious about the usage of silenceremove and VAD together. Sho…
-
Hello all. I am working on a project pertaining to ASR in phone calls. After being dissatisfied with some of the commercial options, I wanted to try this. Is there a built in way to know when the othe…
-
For example, with a silent segment from 00:01:00 to 00:01:30, after being processed by VAD, the subtitle following appears at 00:01:00, causing the entire 30s silence period to have a subtitle. How ca…
-
## ❓Help - The processed file sounds correctly after the VAD, but the model still seems to know where the big silence gap was and stops at this place.
Hi, maybe someone faced with such a situation:…
-
## 🐛 Bug
## To Reproduce
Steps to reproduce the behavior:
import torch
model = torch.load('silero_vad.jit')
-
## 🐛 Bug
The model seems to consistently predicts confidence of above 90% on samples with no speech.
Edit: Testing again it appears that it does predict 0% on true silence, but any background noi…