Open jerrymatjila opened 3 days ago
Hi,
Currently, the model cannot control the speed. However, different random seeds or hyperparameters could generate speech with different speed. You could adjust sub_amount
, top_p
, seed
or cfg_coef
in inference.py
to get various results.
Is there a way to adjust output audio speed when performing zero-shot TTS?