-
# Instruments
We have compared 3 easy-to-use **off-the-shelf instruments for voice activity / audio activity detection**:
- Silero-vad from here - https://github.com/snakers4/silero-vad;
- A po…
-
**Is your feature request related to a problem? Please describe.**
OpenAI just extended their API with realtime support with web sockets
https://openai.com/index/introducing-the-realtime-ap…
-
-
I'm trying to run demo_part1.ipynb with default settings on win10 with python 3.9 through venv and get this error on 7th step:
7:
reference_speaker = 'resources/example_reference.mp3'
target_se, …
-
Hi, thank you for the wonderful library.
Recently, [silero-vad v5](https://github.com/snakers4/silero-vad/releases/tag/v5.0) was released. Do you have any plans to support it in this library?
I t…
-
after updating to the latest version of pandrator, dependencies are missing
[INFO] XTTS Training: ModuleNotFoundError: No module named 'df'
-
Here I will post our benchmarks comparing these three instruments
-
Hey man! Great job with the library, been super duper helpful.
I've been running some tests with live speech, which will be my use case, and I've seen that oftentimes when the audio gets normalized…
-
Is it not possible to transcribe long audio files, around ~3 hours? I am trying to transcribe the 3-hour audio to Hindi, but it uses huge memory.
```
import torch
import nemo.collections.asr as …
-
RT,whisper 转文字的时候,标点和时间有时候有点问题,需要通过 vad 先分割语音部分后,再转写。
也能减少 whsper 的幻觉,提高转写速度。
同时可以把 srt 翻译的功能,单独列做一个小功能。