-
run demo_part1.ipynb to here code:
`reference_speaker = 'resources/example_reference.mp3'
target_se, audio_name = se_extractor.get_se(reference_speaker, tone_color_converter, target_dir='processed',…
-
When I use OpenAI TTS there is no problem, but when I use ElevenLabs TTS, it says the first message but then, I get this message in the terminal: {"message": "11labs reported an error: detected_unusua…
-
I am getting the below error when comiling c++ code.
/usr/local/include/c++/12.2.0/bits/stl_construct.h:119:7: error: no matching function for call to 'Ort::Session::Session(Ort::Env&, const wchar_…
-
### Feature request
New feature using VAD for silence suppression. A better description can be found at https://github.com/jianfch/stable-ts?tab=readme-ov-file#silence-suppression
### Motivation…
-
RT,whisper 转文字的时候,标点和时间有时候有点问题,需要通过 vad 先分割语音部分后,再转写。
也能减少 whsper 的幻觉,提高转写速度。
同时可以把 srt 翻译的功能,单独列做一个小功能。
-
The info in #21 came out me playing quite a bit with trying to make a performant (in iOS Safari) client-side VAD.
I ended up creating a whole new package to get it going: https://github.com/mkcode/…
-
Code:
```from RealtimeSTT import AudioToTextRecorder
import ssl
ssl._create_default_https_context = ssl._create_unverified_context
import torch
model, _ = torch.hub.load(repo_or_dir="snakers4…
-
### What features would you like to see added?
Enhance the current AI chat interface by adding a voice call functionality. This feature will allow users to initiate and engage in voice conversations …
-
Just a handy issue to be notified of latest changes and micro-releases (we will mostly changing the models)
-
Hi, thank you for the wonderful library.
Recently, [silero-vad v5](https://github.com/snakers4/silero-vad/releases/tag/v5.0) was released. Do you have any plans to support it in this library?
I t…