audio-generation Search Results

1000+ results
for audio-generation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/parler-tts #45

[Inference] Set do_sample=False disrupt the generation

I tried to output the same voice for consistency, so I set the do_sample=False. However, the output is basically noise. Here is my code: prompt = "It took me quite a long time to develop a voice, a…

elricwan updated 1 month ago
3
facebookresearch/ImageBind #40

The issue about Audio to Image Generation

An amazing work!!! It's well known that https://github.com/lucidrains/DALLE2-pytorch and https://github.com/LAION-AI/dalle2-laion used open-clip as pretrianed text and image encoder. However, I ha…

liu-zhy updated 1 year ago
3
livepeer/bounties #52

Stable Audio Pipeline implementation Bounty [$850]

# Overview To enhance the feature set of our [ai-network](https://docs.livepeer.org/ai/pipelines/overview#models-on-the-ai-subnet/), we aim to implement a `text-to-audio` pipeline using the [Stable…

JJassonn69 updated 1 month ago
2
microsoft/i-Code #134

Unable to reproduce the results of the paper

Hello. I tried using the demo code of Codi (https://github.com/microsoft/i-Code/tree/main/i-Code-V3) to reproduce results on the AudioCaps dataset. However, I was unable to achieve the results reporte…

XinMing0411 updated 4 days ago
2
facebookresearch/seamless_communication #242

no audio generation from unity.cpp?

I assumed unity.cpp would be on feature parity with the original engine, but looks like it only generates the translated text, and not the audio. Is this something that will be made available in th…

cocktailpeanut updated 9 months ago
1
batterseapower/pinyin-toolkit #131

More pervasive measure word audio generation

MW audio is only being generated in expression onfocuslost. We could add either: 1) A feature like we have for the audio field currently where you can type stuff into the MW Audio field and get it re…

batterseapower updated 15 years ago
1
jishengpeng/WavTokenizer #5

Quality on lower bandwidth?

Hi, this is an amazing work that combines understanding and generation into one single tokenizer. Have you guys tried lower bandwidth, e.g. less than 20 or even 15 tokens per second?

OnceJune updated 3 weeks ago
3
microsoft/RAG_Hack #160

Project: Interactive Learning Platform

### Project Name Curio ### Description ## ✨Curio Curio is a personalised learning platform which uses Retrieval-Augmented Generation (RAG) to generate interactive audio lessons that engage users i…

lilbandit updated 1 week ago
2
Macoron/whisper.unity #45

Hallucinations and VAD [BLANK_AUDIO] Generations

Tested with both small and tiny model sizes. Using the Streaming example with VAD turned on etc. I've tried different settings and tried using a prompt to try and eliminate hallucinations and sound…

atx-barnes updated 1 year ago
5
enricoros/big-AGI #482

LocalAI Integration

Tracking the individual LocalAI APIs Integration: - [x] (great) [Text generation](https://localai.io/features/text-generation/) with GPTs - [x] (good) [Function calling](https://localai.io/feature…

enricoros updated 4 months ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for audio-generation

1000+ results
for audio-generation