-
I tried to output the same voice for consistency, so I set the do_sample=False. However, the output is basically noise. Here is my code:
prompt = "It took me quite a long time to develop a voice, a…
-
An amazing work!!!
It's well known that https://github.com/lucidrains/DALLE2-pytorch and https://github.com/LAION-AI/dalle2-laion used open-clip as pretrianed text and image encoder. However, I ha…
-
# Overview
To enhance the feature set of our [ai-network](https://docs.livepeer.org/ai/pipelines/overview#models-on-the-ai-subnet/), we aim to implement a `text-to-audio` pipeline using the [Stable…
-
Hello. I tried using the demo code of Codi (https://github.com/microsoft/i-Code/tree/main/i-Code-V3) to reproduce results on the AudioCaps dataset. However, I was unable to achieve the results reporte…
-
I assumed unity.cpp would be on feature parity with the original engine, but looks like it only generates the translated text, and not the audio.
Is this something that will be made available in th…
-
MW audio is only being generated in expression onfocuslost. We could add either:
1) A feature like we have for the audio field currently where you can type stuff into the MW Audio field and get it re…
-
Hi, this is an amazing work that combines understanding and generation into one single tokenizer. Have you guys tried lower bandwidth, e.g. less than 20 or even 15 tokens per second?
-
### Project Name
Curio
### Description
## ✨Curio
Curio is a personalised learning platform which uses Retrieval-Augmented Generation (RAG) to generate interactive audio lessons that engage users i…
-
Tested with both small and tiny model sizes.
Using the Streaming example with VAD turned on etc. I've tried different settings and tried using a prompt to try and eliminate hallucinations and sound…
-
Tracking the individual LocalAI APIs Integration:
- [x] (great) [Text generation](https://localai.io/features/text-generation/) with GPTs
- [x] (good) [Function calling](https://localai.io/feature…