rsxdalv / tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)
https://rsxdalv.github.io/tts-generation-webui/
MIT License
1.68k stars 180 forks source link

Stereo WAV files do not work #374

Open Consequence6 opened 1 week ago

Consequence6 commented 1 week ago

When using a stereo audio file, with the Maha-TTS tab it returns RuntimeError: shape '[1, 1, 484463]' is invalid for input of size 968926

Converting the file to mono fixes it.