speech-input Search Results

1000+ results
for speech-input

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

marawanxmamdouh/ConvoNerd #19

Sub-Feature 2: Audio File Handling - Speech-to-Text Conversi…

**Parent ticket:** [Feature: Audio File Handling](https://github.com/marawanxmamdouh/ConvoNerd/issues/17) ### Description: Implement a speech-to-text module to transcribe audio content into text. …

marawanxmamdouh updated 1 week ago
7
FunAudioLLM/CosyVoice #433

why the length of the final chunk changes even when using th…

Hi, I repeated the streaming generation several times with the same input but I found that the length of the final yielded chunk changes every time. As you can see below, the yield speech len of the …

huskyachao updated 2 weeks ago
1
gpt-omni/mini-omni #5

Some common questions about our model.

Q: "I have four questions that I would like to confirm or discuss: --- Does this model have the capability for streaming TTS? I only saw streaming audio tokens mentioned, so is this Encodec (SNAC) …

superFilicos updated 1 month ago
2
CHOMPStation2/CHOMPStation2 #8792

[GENERAL] TGUI Say

- [x] Keep typed buffer when switching up. - [ ] Increase typing field size - [ ] allow to disable thinking state - [x] old inputs currently don't show typing indicator - [x] Enable the old verbs …

Kashargul updated 1 week ago
1
nchapman/nebula #16

Investigation | Handle speech as an input method

**Why** As of March 26, 2024, we know how to capture but not sure how to send the speech input results. **What** Within this task it is required to investigate how to handle the speed input approach

andriikrainii updated 5 months ago
1
Lightning-AI/LitServe #320

Websocket Support for Streaming Input and Output

---- ## 🚀 Feature Support websocket endpoints to allow two-way real-time data communication. ### Motivation Currently, the requests are processed with the expectation that the …

ChenghaoMou updated 2 days ago
4
OpenMOSS/AnyGPT #25

About input formats for training and inference

Anygpt is trained only with the Next Token Prediction task. Take text to image as an example，Is the training input speech tokens text tokens image tokens music tokens? I want to know the input…

wen020 updated 2 months ago
2
huggingface/speech-to-speech #107

Latency Optimization for Speech-to-Speech Pipeline

Hi, I am currently running the speech-to-speech pipeline on an AWS EC2 instance (Ubuntu 20.04) with an Nvidia A10g GPU. The pipeline works well, but I am experiencing around 1 second of latency, an…

yatharthk2 updated 3 weeks ago
3
OpenPecha/tts-model #2

TTS0005: Create a pipeline for MMS TTS model, and train on t…

### **Description** We are going to fine-tune Meta's **MMS (Massively Multilingual Speech)** model for a Tibetan speaker named **Sherab** using Sherab's dataset. The process includes preparing Shera…

gangagyatso4364 updated 6 days ago
12
Azure/azure-sdk-for-net #45340

Confusing naming and not working

### Type of issue Code doesn't work ### Description I've tried to set the audio output speaker using this and after running: ` var enumerator = new MMDeviceEnumerator(); …

1cuu7 updated 2 months ago
1

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for speech-input

1000+ results
for speech-input