[Feature request] Can we add the batch inference or batch decoding for XTTS

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

http://coqui.ai

Mozilla Public License 2.0

31.64k stars 3.78k forks source link

[Feature request] Can we add the batch inference or batch decoding for XTTS #3776

Open Onkarsus13 opened 3 weeks ago

Onkarsus13 commented 3 weeks ago

I tried the batch inference in XTTS, So I am doing padding till the max text sequence in the batch and also adding the attention mask for this, But for shorter sequences, I am getting some random noise at the end of the audio It would be helpful if we get this feature in Coqui tts.

tuanh123789 commented 4 days ago

I face same problem when infer with batch size. Do you solve it