LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
With this single scrip to download both the fine tuned Llama-3.1-8B-Omni mode and Whisper-large-v3 model will make the setup process more easier, and also solve few ambiguities.
With this single scrip to download both the fine tuned Llama-3.1-8B-Omni mode and Whisper-large-v3 model will make the setup process more easier, and also solve few ambiguities.