LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Hi there,
Good job!
Do you have a plan to release the codes for getting discrete units (using Hubert and a K-means model)?
or give a guidance on where to download the required encoder model.
Hi there, Good job! Do you have a plan to release the codes for getting discrete units (using Hubert and a K-means model)? or give a guidance on where to download the required encoder model.
thanks!