-
**Parent ticket:** [Feature: Audio File Handling](https://github.com/marawanxmamdouh/ConvoNerd/issues/17)
### Description:
Implement a speech-to-text module to transcribe audio content into text.
…
-
Hi, I repeated the streaming generation several times with the same input but I found that the length of the final yielded chunk changes every time. As you can see below, the yield speech len of the …
-
Q: "I have four questions that I would like to confirm or discuss:
--- Does this model have the capability for streaming TTS? I only saw streaming audio tokens mentioned, so is this Encodec (SNAC) …
-
- [x] Keep typed buffer when switching up.
- [ ] Increase typing field size
- [ ] allow to disable thinking state
- [x] old inputs currently don't show typing indicator
- [x] Enable the old verbs …
-
**Why**
As of March 26, 2024, we know how to capture but not sure how to send the speech input results.
**What**
Within this task it is required to investigate how to handle the speed input approach
-
----
## 🚀 Feature
Support websocket endpoints to allow two-way real-time data communication.
### Motivation
Currently, the requests are processed with the expectation that the …
-
Anygpt is trained only with the Next Token Prediction task.
Take text to image as an example,Is the training input speech tokens text tokens image tokens music tokens?
I want to know the input…
-
Hi,
I am currently running the speech-to-speech pipeline on an AWS EC2 instance (Ubuntu 20.04) with an Nvidia A10g GPU. The pipeline works well, but I am experiencing around 1 second of latency, an…
-
### **Description**
We are going to fine-tune Meta's **MMS (Massively Multilingual Speech)** model for a Tibetan speaker named **Sherab** using Sherab's dataset. The process includes preparing Shera…
-
### Type of issue
Code doesn't work
### Description
I've tried to set the audio output speaker using this and after running:
` var enumerator = new MMDeviceEnumerator();
…
1cuu7 updated
2 months ago