-
Hi,
We trained a Zipformer model with approximately 20k hours of Hindi audio data, containing files ranging between 2-14 seconds. The test data consists of longer audio files with extended period…
-
Hi All,
How are you?
We would like to adapt the VAD model to a new domain / case which is not handled in the current version.
Is it possible to fine-tune the current VAD? if not can you add a tunea…
-
脚本无法运行,参数是按照testing.py里面提供的原始内容进行填写的
-
我用进程方式启动AutoModel处理,16K单声道的wav音频数据,vad模型内部处理数据直接卡住不动,请大佬帮我看看进程启动下vad模型内部处理数据为什么会卡住。
funasr->utils->load_utils.py的64行 data_or_path_or_list = data_or_path_or_list.mean(0)
ps:用线程模式就能正常执行,但咱们线程模式长时间运行…
-
REQUEST:
1. add a vad_segments parameter to the .transcribe() method (and don't use internal VAD in case of external segments use)
2. add an option to disable VAD
REASON:
1. I want to use my…
-
How to export an ONNX with opset version = 13? Currently, the silero_vad.onnx is opset version = 16.
Could you tell me how to get other opset version of the ONNX model?
Thanks
-
I ran code below using WSL ubuntu in windows:
```
docker run -p 9090:9090 --runtime=nvidia --gpus all --entrypoint /bin/bash -it ghcr.io/collabora/whisperlive-tensorrt
# Build tiny.en engine
bas…
-
Hello, I am currently trying to reproduce the results of VAD_tiny version and I'm unable to do so. Here's the results am getting :
-------------- Motion Prediction --------------
EPA_car: -0.041556…
-
Приветствую.
Есть ли возможность передавать параметры VAD в приложение GRPCSTTBackground?
Хотелось бы менять параметр silence_duration_threshold в зависимости от определенного UserEvent в dialplan…
-
The readme says:
> The VAD that Google developed for the WebRTC project is reportedly one of the best available, being fast, modern and free.
However I was unable to witness any auspicious accu…