rsxdalv / tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
https://rsxdalv.github.io/tts-generation-webui/
MIT License
1.81k stars 197 forks source link

Bug report: Application exits when trying to clone voice in Bark Voice Clone #416

Open Aamir3d opened 1 day ago

Aamir3d commented 1 day ago

Steps

  1. Start Webui
  2. Switch to BARK voice clone tab
  3. Add an MP3 file (via upload)
  4. Click Generate image

Expected result - new voice file created from MP3 Actual result - Application exits with no error messages image

rsxdalv commented 1 day ago

I will test it out. What about non mp3, does that work?

rsxdalv commented 1 day ago

I was able to do it, so there's a more complicated issue. MP3 does not seem to be the issue either. I can suggest two methods: trying it again without GPU, or trying it after the computer has been restarted.

Generally if it crashes without any info that's a high-level crash, like PC running out of memory.

Aamir3d commented 1 day ago

I was able to do it, so there's a more complicated issue. MP3 does not seem to be the issue either. I can suggest two methods: trying it again without GPU, or trying it after the computer has been restarted.

Generally if it crashes without any info that's a high-level crash, like PC running out of memory.

Thanks - I'm not out of memory (12 GB GPU, but it shows 50% free when the model loads). However, I'll try a restart later and see how it goes.

I just tested F5 TTS through the WebUI and that works flawlessly. I also tested the Audio/Music Generation section with Medium and Audiogen models, they work without issues too.

Aamir3d commented 1 day ago

So I turned off the GPU, but still no good. The application crashed. image

However, when going through the startup log, I noticed this (before the Gradio WebUI started). Not sure if that's got anything to do with the crash. image

rsxdalv commented 1 day ago

Thanks for the debug info. It really seems like a black box error. It's possible that it happens with fairseq, but the root cause might be another component, such as xformers.

As for the database error, it shouldn't cause the problem.

I'm wondering if RVC works or not. If not, it could really be a fairseq issue. Then we can try reinstalling fairseq or building from source.