-
this is the Code
import io
import numpy as np
import torch
torch.set_num_threads(1)
import torchaudio
import matplotlib
import matplotlib.pylab as plt
torchaudio.set_audio_backend("soundfil…
-
## 🐛 Bug
Loading the model with hub.load fails
## To Reproduce
```python
base $~ python3Python 3.8.5 (default, Sep 4 2020, 07:30:14)
[GCC 7.3.0] :: Anaconda, Inc. on linux
Type "help", "c…
-
## 🐛 Bug
I would like to use cuda to compute the vad. Your tookit has an argument for it: https://github.com/snakers4/silero-vad/blob/a345715b8fc2d24b2991ec5d54c7588c64b9f9c7/utils_vad.py#L174
B…
-
## ❓ Questions and Help
Excuse me, what model is used for VAD of this project? Is there an article about this project?
-
Hi, I am tying to use the Language Classifier 95 model, but the accuracy is not so good.
I have tried to increase the top_n value, but did not help too much.
I thought I can neglect most of the lang…
-
## 🐛 Bug
torch.cat called with an empty list of tensors in utils_vad.py.
```python
Process Process-13:
Traceback (most recent call last):
File "/lium/raid01_b/pchampi/lab/sidekit-for-vpc/ve…
-
Hi,
I am trying to run the speech-to-text model on GPU/CPU for large audio file. but I got out-of-memory error from both sides.
Is there any iterable lazy dataloader that can feed the audio fil…
-
Hello I'm trying to load onnx model using JS in browser. I'm using official example from ONNX github:
```
// create a session
const myOnnxSession …
-
I just want to classify an 20ms audio whether there are people talking.
Could you give me some examples?
Thanks
-
When i run this code
`speech_timestamps = get_speech_ts_adaptive(wav, model, visualize_probs=True)`
this error:
![image](https://user-images.githubusercontent.com/60197702/124263777-5b3a4c00-db3c-1…