LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Hi, I noticed in the readme file that you said you had an issue with the stability of the gradio lib for audio streaming.
It turns out that another repository called Omni-mini contains an alternative way to use gradio that could be of interest to you. Here's the link: https://huggingface.co/spaces/gradio/omni-mini
Hi, I noticed in the readme file that you said you had an issue with the stability of the gradio lib for audio streaming.
It turns out that another repository called Omni-mini contains an alternative way to use gradio that could be of interest to you. Here's the link: https://huggingface.co/spaces/gradio/omni-mini