-
I was curious to know the memory requirements for running the model. Can anyone who has run the model, share the memory and any compute numbers such as the time it took to run for your task?
-
When I am using the pipeline, I meet an error: KeyError: "Unknown task depth-estimation, available tasks are ['audio-classification', 'automatic-speech-recognition', 'conversational', 'feature-extrac…
-
How about adding Text-to-Speech alternatives to openai, such as: deepgram, fish.audio. Similarly adding other LLMs as well.
-
Hi, authors. Thanks for your greate work. I have a question about stage1 training. It doesnot have an input of audio feature. So, what is the meaning of the stage1. Just think that, we have same ref i…
-
### System Info
- `transformers` version: 4.45.1
- Platform: Linux-5.10.225-213.878.amzn2.x86_64-x86_64-with-glibc2.31
- Python version: 3.11.9
- Huggingface_hub version: 0.25.1
- Safetensors ver…
-
prompt = "google is a great website to let you find your niche."
description = "A female speaker with a slightly low-pitched voice delivers her words quite expressively, in a very confined sounding e…
-
### Feature Name
Research about Stability.ai
### Feature Description
This a research about Stability.ai, learning more about its supported models, how it is used and many more
### Motivati…
-
This seems like a great project! I am involved in several AI projects that use voice generation, is it in your plans to add a web server (REST API / websockets) by any chance that could load a voice m…
-
Can you please fix it?
https://huggingface.co/spaces/haoheliu/audioldm-text-to-audio-generation
-
**Describe the bug**
Audios generated for `gu-IN` locale using voice `gu-IN-DhwaniNeural` contains about 3 sec silence at the end of audio file. The same generation, performed using `gu-IN-NiranjanNe…