-
### Description
We are currently facing issue with speed of inference of tts model. we have two tts models trained mms tts and speech t5.
Possible Solutions we should look into.
1. Quantization of s…
-
To Reproduce
1.docker pull
registry.cn-hangzhou.aliyuncs.com/funasr_repo/funasr:funasr-runtime-sdk-gpu-0.2.0
mkdir -p ./funasr-runtime-resources/models
sudo docker run --gpus=all --net=host -it --…
-
**Issue Description**:
There are some strange interactions with being **Dead but Responsive and Asleep at the same time (DRA)** when the AI speaks via a holopad in the room you're in.
If the A…
-
I want to use wavtokenizer to speech AI. Is avtokenizer apply streaming infer?
-
GLM-4-Voice is an end-to-end voice model launched by Zhipu AI. GLM-4-Voice can directly understand and generate Chinese and English speech, engage in real-time voice conversations, and change attribut…
-
现在已经有不少开源的 AI TTS, 比如 f5-tts/fish-speech/gpt-sovits 等。
这些 TTS 虽然合成速度缓慢,但是作为听书或者文本转音频使用还是不错的。
这些 TTS 一般都提供了通过 http 方式调用的接口,但是没有直接使用 SAPI5 来的方便。
所以我想,您是否有兴趣开发一款 AI TTS 的 SAPI5 适配器呢?
-
Is it possible to have a native support for Bark TTS or langchain version of it?
-
Hi!
I’m really excited about this project! I have a similar one that uses JavaScript.
I would like to include an option for Speech-to-Text (STT). I’ve found that the Facebook model provides bett…
-
Bug:
2024-11-13 23:59:20.548 | ERROR | pipecat.services.azure:_handle_canceled:325 - Speech synthesis canceled: CancellationReason.Error
Got this error while using Azure TTS and Twilio(Fastapi…
-
# apipie_ai
## URLs
- https://apipie.ai/docs/api/introduction
## Actions
### chat
#### Prompt
Query LLM using text and, for vision-capable models, image data. Use reloadProps to change betwe…