fixie-ai / ultravox

A fast multimodal LLM for real-time voice
https://ultravox.ai
MIT License
1.4k stars 88 forks source link

Dutch Language Support #139

Open benlower opened 1 month ago

matejsarlija commented 3 weeks ago

Technically for all of these language requests, one can retrain / finetune the base model. With speech-to-speech, dataset seems a bigger moat then ever.

asultanoff commented 5 days ago

Technically for all of these language requests, one can retrain / finetune the base model. With speech-to-speech, dataset seems a bigger moat then ever.

Can you please elaborate more on speech2speech part? Ultravox inferences text tokens and used tts system for speech no?