-
Hi Unsloth!
I came across this interesting model on reddit: https://www.reddit.com/r/LocalLLaMA/comments/1ez8rmu/llama31_just_got_ears_early_experiments/
It allows Text and Audio as input, and o…
-
I found quite frustrating while I trying to use the TTS combine with azure 's ASR . For some reason, The TTS output was received by ASR incorrectly even if I mute the microphone with Pyaudio. So pleas…
-
I have tried all the possible settings for Models, sample rate, and channels, I am not able to get recognized speech from VOSK, just the empty strings, I have tried the same sample on free speech reco…
-
I am on ubuntu with a 4090, Ryzen 9 7950X and 64gb ddr5.
I've been using the original tortoise tts for a while, but wanted to try your version. However, most voices don't work. For example, "ed" wo…
-
# Task Name: Text-to-Audio Generation
The task aims to generate general audio based on the given holistic text description.
## Task Objective
The primary goal of the Text-to-Sound (TTA) Gener…
-
### Describe the bug
Here I am attaching screen recording video to show case the results. there is flicker at the start to... today which is weired. Let me know if more details is required. Thanks!
…
-
I'm trying to use the code inside `text_to_speech_stream.ts` to get audio stream and upload it on aws s3 but it gives me below error.
```
/**
* Uses the ElevenLabs API to convert text to audio
…
-
I only find the loading function of text, vision_data, audio_data but none depth and thermal. How can i load depth and thermal data.
ModalityType.TEXT: data.load_and_transform_text(text_list, devic…
-
**Flutter Version**
My version : 3.24
**Lib Version**
My version : 3.0.0
**Platform (Android / iOS / web) + version**
Platform : Android 14
**Describe the bug**
App crashes when i…
-
The profile registry for TTML is: https://www.w3.org/TR/ttml-profile-registry/
We need to have a profile identifier for DAPT. I would suggest ... `dapt`.