SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
https://arxiv.org/abs/2410.06885
MIT License
7.36k stars 885 forks source link

Error while generating / no audio #404

Closed Takinono closed 2 weeks ago

Takinono commented 2 weeks ago

Checks

Environment Details

Windows 11, RTX 3070

Steps to Reproduce

When I upload my audio sample along with my prompt and hit synthesize, nothing happens. I do have weird error messages:

image

but it remains stuck at 0% after "Prefix dict has been built successfully".

Edit, tried in CLI, same problem

image

I tried following these fixes, but it doesn't solve the problem. https://github.com/SWivid/F5-TTS/issues/356

Thanks for your help

✔️ Expected Behavior

No response

❌ Actual Behavior

No response

SWivid commented 2 weeks ago

What is your torch (cuda or cpu version installed?) and cuda version, and how it the gpu usage when inference. It would be appreciate if closely follow the issue template and provide detailed info to help figure out. image

Takinono commented 2 weeks ago

Sorry, I'm using : Windows 11 RTX 3070 conda 24.9.2 torch Version: 2.5.1 CUDA Version: 12.3

When using CLI, CPU usage is between 70 and 80%, GPU 1% When using web interface, CPU between 70 and 80%, GPU 1 to 5%

SWivid commented 2 weeks ago

@Takinono so it seems like a torch-cpu version installed try pip install torch==2.5.1+cu121 torchaudio==2.5.1+cu121 --extra-index-url https://download.pytorch.org/whl/cu121 if you still want torch2.5.1 or just following installation instruction: pip install torch==2.3.0+cu118 torchaudio==2.3.0+cu118 --extra-index-url https://download.pytorch.org/whl/cu118

Takinono commented 2 weeks ago

This solved the problem ! pip install torch==2.5.1+cu121 torchaudio==2.5.1+cu121 --extra-index-url https://download.pytorch.org/whl/cu121

Thanks !

SWivid commented 2 weeks ago

will close as solved, feel free to open if further questions