erew123 / alltalk_tts

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
GNU Affero General Public License v3.0
1.11k stars 114 forks source link

DeepSpeed did not install on standard windows standalone install. #406

Closed tre4b closed 2 hours ago

tre4b commented 2 hours ago

🔴 Please generate a diagnostics report and upload the "diagnostics.log" as this helps me understand your configuration.

diagnostics.log

Describe the bug Alltalk is working but seemed slow and I noticed DeepSpeed could not be enabled. I noted when All talk started I got the following message, one of which is DeepSpeed version not available. I went into the conda environment and found there was no deepspeed.

I tried to install this but it would not install as it said pytorch was not available, sure enough this was not there in conda list either (nor were either in pip list).

I opened your batch file and went through section by section doing the installs manually. DeepSpeed installed successfully and was visible in Conda list, however on start of Alltalk I still get the screen below.

[AllTalk TTS] ____ _ [AllTalk TTS] / \ | | | | _| | | | | / | [AllTalk TTS] / \ | | | | |/ ` | | |/ / | | | | \ \ [AllTalk TTS] / | | | | | (| | | < | | | | ) | [AllTalk TTS] // __|| ||_,|||_\ || || |/ [AllTalk TTS] [AllTalk TTS] Config file update: No Updates required [AllTalk TTS] Start-up Mode : Standalone mode [AllTalk TTS] WAV file deletion : Disabled [AllTalk TTS] Github updated : 15th November 2024 at 00:40 Branch: alltalkbeta [AllTalk ENG] Transcoding : ffmpeg found [AllTalk ENG] DeepSpeed version : Not available [AllTalk ENG] Python Version : 3.11.10 [AllTalk ENG] PyTorch Version : 2.2.1 [AllTalk ENG] CUDA Version : 12.1 [AllTalk ENG] [AllTalk ENG] Model/Engine : f5tts - f5tts_v1 loading into cuda [AllTalk ENG] Model/Engine : F5-TTS Ready [AllTalk ENG] Load time : 4.96 seconds. [AllTalk TTS] [AllTalk TTS] API Address : 127.0.0.1:7851 [AllTalk TTS] Gradio Light: http://127.0.0.1:7852 [AllTalk TTS] Gradio Dark : http://127.0.0.1:7852?__theme=dark [AllTalk TTS] [AllTalk TTS] AllTalk WIKI: https://github.com/erew123/alltalk_tts/wiki [AllTalk TTS] Errors Help : https://github.com/erew123/alltalk_tts/wiki/Error-Messages-List

What other steps need to be taken to ensure that DeepSpeed can be recognised by the system?

erew123 commented 2 hours ago

Hi @tre4b

Please see the engine information for F5-TTS. It is not a DeepSpeed capable TTS engine:

image

Thanks