-
I've managed to get a POC with voice consistency working pretty well. Along the way, I've figured out how to do _ok-ish_ zero-shot voice cloning, too. It took drawing on tidbits spread between several…
-
It's awesome to see MusicGen on MMLX.
Can we have the duration parameter to set how long the generated audio should be.
The [original musicgen](https://github.com/facebookresearch/audiocraft/blob/ma…
-
### Model/Pipeline/Scheduler description
ConsistencyTTA, introduced in the paper [_Accelerating Diffusion-Based Text-to-Audio Generation
with Consistency Distillation_](https://arxiv.org/abs/2309.…
-
I just generated a considerably large output (about 11k lines). Upon exporting I faced an error "file audio_3557.wav is missing". I found about 60 files indeed missing even though the interface showed…
-
# Task Name: Audio Caption Generation
Generating natural language description for any kind of audio in the wild.
## Task Objective
The task of Audio Caption Generation involves a model receivin…
-
This is my simple test script:
```python
import torch
from parler_tts import ParlerTTSForConditionalGeneration
from transformers import AutoTokenizer
import soundfile as sf
torch_device = "m…
-
Hello, thanks for this great job! I followed the instructions [INFERENCE](https://github.com/huggingface/parler-tts/blob/main/INFERENCE.md) , but encountered some difficulties.
```
from parler_tts i…
-
- [x ] I have read and agree to the [contributing guidelines](https://github.com/griptape-ai/griptape#contributing).
Can we have an audio generation driver for SunoAI for audio generation?
https:…
-
**Describe the solution you'd like**
Several Options/Techniques available. I've Tested a few;
1. Auto Generation at transition points (beat kicks, etc;)
2. Audio Reactive Video2Video
3. Audio Rea…
-
### Describe the bug
After updating packages and external modules, I suddenly could no longer start the project with this error:
```
* Running on local URL: http://0.0.0.0:7770
Traceback (mos…