-
### Self Checks
- [X] This template is only for bug reports. For questions, please visit [Discussions](https://github.com/fishaudio/fish-speech/discussions).
- [X] I have thoroughly reviewed the proj…
-
Similar to what I suggested in #1478, it would be great if text-to-speech had a set of common types. The `SpeechModel` interface, as well as `SpeechPrompt`, `SpeechResponse`, and `StreamingSpeechModel…
-
When using LiveKit agents, sometimes the agent hears its own TTS output (eg via the laptop speakers) which is then interpreted as speech from the user.
This then creates a feedback loop where the a…
-
https://github.com/OpenMOSS/AnyGPT/blame/6404dbafccc10943be6bf6e24a4b99b3a6545501/anygpt/src/m_utils/prompter.py#L45
Hello,
Is this line correct? Is this for speech-to-speech conversation?
In tha…
-
### 🚀 The feature, motivation and pitch
As we all know, GPT-4o is an end2end multi-modal models, which support Speech to Text/Speech. I have some ideas about it:
1. Speech to Text: Can we have a t…
-
参考 [huggingface上的代码](https://huggingface.co/funasr/campplus#voice-activity-detection-streaming),会存在一个问题,即当前的chunk进到 vad 模型中,得到了 value 的结果,比如说拿到了 start 的结果,但是这个 start 的时间点是位于前 3 到 4 个 chunks 里面的。请问有什么方…
-
Speech rate setting in tray and via keyboard shortcuts would be great in addition to settings.conf file.
-
### Bug ID: 16
**Date:** 2024-10-14 00:00:00
**Severity:** Minor
**Title:** The Text appears separately from the bubble speech in "Dogecoin" Stories when viewing the story.
#### Precondition:
1…
-
Hi. I followed the set up in the readme and run `bash omni_speech/infer/run.sh omni_speech/infer/examples` but encounter this error
```
Traceback (most recent call last):
File "/home/rczheng/LLaM…
-
## What's the problem?
Creators who rely on their voice for input cannot easily create Wordplay projects.
## What's the design idea?
Partner with speech-reliant teachers and students to co-de…