speech Search Results - Githubissues

1000+ results
for speech

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

fishaudio/fish-speech #671

while training full or warnings and errors, the weights are …

### Self Checks - [X] This template is only for bug reports. For questions, please visit [Discussions](https://github.com/fishaudio/fish-speech/discussions). - [X] I have thoroughly reviewed the proj…

padmanabanSampath updated 1 week ago
1
spring-projects/spring-ai #1496

Common types for text-to-speech

Similar to what I suggested in #1478, it would be great if text-to-speech had a set of common types. The `SpeechModel` interface, as well as `SpeechPrompt`, `SpeechResponse`, and `StreamingSpeechModel…

habuma updated 3 weeks ago
2
livekit/agents #315

Agent speech output audio is interpreted as user speech

When using LiveKit agents, sometimes the agent hears its own TTS output (eg via the laptop speakers) which is then interpreted as speech from the user. This then creates a feedback loop where the a…

andrewjhogue updated 2 weeks ago
4
OpenMOSS/AnyGPT #32

Speech-to-Speech task prompt

https://github.com/OpenMOSS/AnyGPT/blame/6404dbafccc10943be6bf6e24a4b99b3a6545501/anygpt/src/m_utils/prompter.py#L45 Hello, Is this line correct? Is this for speech-to-speech conversation? In tha…

ehosseiniasl updated 3 months ago
6
X-LANCE/SLAM-LLM #78

Do you have any plan about Speech to Text or Speech to Speec…

### 🚀 The feature, motivation and pitch As we all know, GPT-4o is an end2end multi-modal models, which support Speech to Text/Speech. I have some ideas about it: 1. Speech to Text: Can we have a t…

Irvingao updated 2 weeks ago
7
modelscope/3D-Speaker #155

VAD 流式推理的问题

参考 [huggingface上的代码](https://huggingface.co/funasr/campplus#voice-activity-detection-streaming)，会存在一个问题，即当前的chunk进到 vad 模型中，得到了 value 的结果，比如说拿到了 start 的结果，但是这个 start 的时间点是位于前 3 到 4 个 chunks 里面的。请问有什么方…

TungyuYoung updated 18 hours ago
1
jame25/Piper-Tray #4

[Enhancement] Speech rate

Speech rate setting in tray and via keyboard shortcuts would be great in addition to settings.conf file.

agiz10 updated 3 weeks ago
3
wwwser11/ComiCoin-Mobile-QA #16

The Text appears separately from the bubble speech in "Dogec…

### Bug ID: 16 **Date:** 2024-10-14 00:00:00 **Severity:** Minor **Title:** The Text appears separately from the bubble speech in "Dogecoin" Stories when viewing the story. #### Precondition: 1…

wwwser11 updated 6 days ago
1
ictnlp/LLaMA-Omni #49

Error while inference

Hi. I followed the set up in the readme and run `bash omni_speech/infer/run.sh omni_speech/infer/examples` but encounter this error ``` Traceback (most recent call last): File "/home/rczheng/LLaM…

ZhengRachel updated 3 weeks ago
1
wordplaydev/wordplay #394

Speech input

## What's the problem? Creators who rely on their voice for input cannot easily create Wordplay projects. ## What's the design idea? Partner with speech-reliant teachers and students to co-de…

amyjko updated 1 month ago
5

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for speech

1000+ results
for speech