FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
https://funaudiollm.github.io/
Apache License 2.0
6.02k stars 643 forks source link

torchrun failed to run on multi nodes #241

Open zhangyike opened 3 months ago

zhangyike commented 3 months ago

I meet the following error with training llm on 2 gpu machines:

How to set torchrun options when multi gpu nodes are used?

aluminumbox commented 2 months ago

did you follow libritts example? libritts recipe is already validated, try run it first