erew123 / alltalk_tts

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
GNU Affero General Public License v3.0
1.16k stars 123 forks source link

Not working on Windows 11 (installing beta version for the first time) #347

Closed nitinmukesh closed 2 months ago

nitinmukesh commented 2 months ago

Describe the bug Standalone install complete but the server is not starting

To Reproduce Clone the beta version and complete the setup using atsetup.bat

Screenshots N.A.

Text/logs

Microsoft Windows [Version 10.0.22631.4169]
(c) Microsoft Corporation. All rights reserved.

C:\tut\alltalk_tts>start_alltalk.bat
[AllTalk TTS]     _    _ _ _____     _ _       _____ _____ ____
[AllTalk TTS]    / \  | | |_   _|_ _| | | __  |_   _|_   _/ ___|
[AllTalk TTS]   / _ \ | | | | |/ _` | | |/ /    | |   | | \___ \
[AllTalk TTS]  / ___ \| | | | | (_| | |   <     | |   | |  ___) |
[AllTalk TTS] /_/   \_\_|_| |_|\__,_|_|_|\_\    |_|   |_| |____/
[AllTalk TTS]
[AllTalk TTS] Config file update: No Updates required
[AllTalk TTS] Start-up Mode     : Standalone mode
Traceback (most recent call last):
  File "C:\tut\alltalk_tts\system\config\firstrun.py", line 8, in <module>
    from inputimeout import inputimeout, TimeoutOccurred
ModuleNotFoundError: No module named 'inputimeout'
Error occurred while running the script: Command '['python', 'C:\\tut\\alltalk_tts\\system\\config\\firstrun.py']' returned non-zero exit status 1.
[AllTalk TTS] WAV file deletion : Disabled
[AllTalk TTS] Github updated    : 15th August 2024 at 08:27
Traceback (most recent call last):
  File "C:\tut\alltalk_tts\tts_server.py", line 147, in <module>
    from ffmpeg.asyncio import FFmpeg
ModuleNotFoundError: No module named 'ffmpeg'
[AllTalk TTS] Warning TTS Engine has NOT started up yet. Will keep trying for 240 seconds maximum. Please wait.
[AllTalk TTS] Warning Mechanical hard drives and a slow PCI BUS are examples of things that can affect load times.
[AllTalk TTS] Warning Some TTS engines index their AI TTS models on loading, which can be slow on CPU or old systems.
[AllTalk TTS] Warning Using one of the other TTS engines on slower systems can help ease this issue.
[AllTalk TTS]
[AllTalk TTS] Startup timed out. Full help available here https://github.com/erew123/alltalk_tts#-help-with-problems
[AllTalk TTS] On older systems, you may wish to open and edit script.py with a text editor and change the
[AllTalk TTS] startup_wait_time = 240 setting to something like startup_wait_time = 460 as this will allow
[AllTalk TTS] AllTalk more time (6 mins) to try load the model into your VRAM. Otherwise, please visit the GitHub for
[AllTalk TTS] a list of other possible troubleshooting options.

Desktop (please complete the following information): AllTalk was updated: [approx. date] - using beta version and also git pull complete Custom Python environment: [yes/no give details if yes] No Text-generation-webUI was updated: [approx. date] N. A. as standalone install

Additional context Log file diagnostics.log

nitinmukesh commented 2 months ago

Additional steps completed after the above

This is required as conda install does not install torch with CUDA on Windows. pip install torch==2.2.1+cu121 torchvision==0.17.1+cu121 torchaudio==2.2.1+cu121 --index-url https://download.pytorch.org/whl/cu121

(C:\tut\alltalk_tts\alltalk_environment\env) C:\tut\alltalk_tts>pip install inputimeout
Looking in indexes: https://pypi.org/simple/, https://pypi.ngc.nvidia.com
Collecting inputimeout
  Downloading inputimeout-1.0.4-py3-none-any.whl.metadata (2.2 kB)
Downloading inputimeout-1.0.4-py3-none-any.whl (4.6 kB)
Installing collected packages: inputimeout
Successfully installed inputimeout-1.0.4
(C:\tut\alltalk_tts\alltalk_environment\env) C:\tut\alltalk_tts>pip install python-ffmpeg
Looking in indexes: https://pypi.org/simple/, https://pypi.ngc.nvidia.com
Collecting python-ffmpeg
  Downloading python_ffmpeg-2.0.12-py3-none-any.whl.metadata (3.2 kB)
Collecting pyee (from python-ffmpeg)
  Downloading pyee-12.0.0-py3-none-any.whl.metadata (2.8 kB)
Requirement already satisfied: typing-extensions in c:\tut\alltalk_tts\alltalk_environment\env\lib\site-packages (from python-ffmpeg) (4.11.0)
Downloading python_ffmpeg-2.0.12-py3-none-any.whl (14 kB)
Downloading pyee-12.0.0-py3-none-any.whl (14 kB)
Installing collected packages: pyee, python-ffmpeg
Successfully installed pyee-12.0.0 python-ffmpeg-2.0.12
(C:\tut\alltalk_tts\alltalk_environment\env) C:\tut\alltalk_tts>pip install sounddevice
Looking in indexes: https://pypi.org/simple/, https://pypi.ngc.nvidia.com
Collecting sounddevice
  Downloading sounddevice-0.5.0-py3-none-win_amd64.whl.metadata (1.4 kB)
Requirement already satisfied: CFFI>=1.0 in c:\users\nitin\appdata\roaming\python\python311\site-packages (from sounddevice) (1.16.0)
Requirement already satisfied: pycparser in c:\users\nitin\appdata\roaming\python\python311\site-packages (from CFFI>=1.0->sounddevice) (2.21)
Downloading sounddevice-0.5.0-py3-none-win_amd64.whl (189 kB)
Installing collected packages: sounddevice
Successfully installed sounddevice-0.5.0

Now

Microsoft Windows [Version 10.0.22631.4169]
(c) Microsoft Corporation. All rights reserved.

(C:\tut\alltalk_tts\alltalk_environment\env) C:\tut\alltalk_tts>start_alltalk.bat
[AllTalk TTS]     _    _ _ _____     _ _       _____ _____ ____
[AllTalk TTS]    / \  | | |_   _|_ _| | | __  |_   _|_   _/ ___|
[AllTalk TTS]   / _ \ | | | | |/ _` | | |/ /    | |   | | \___ \
[AllTalk TTS]  / ___ \| | | | | (_| | |   <     | |   | |  ___) |
[AllTalk TTS] /_/   \_\_|_| |_|\__,_|_|_|\_\    |_|   |_| |____/
[AllTalk TTS]
[AllTalk TTS] Config file update: No Updates required
[AllTalk TTS] Start-up Mode     : Standalone mode
[AllTalk TTS] WAV file deletion : Disabled
[AllTalk TTS] Github updated    : 15th August 2024 at 08:27
[AllTalk ENG] Transcoding       : ffmpeg found
[AllTalk ENG] DeepSpeed version : 0.14.0+ce78a63
[AllTalk ENG] Python Version    : 3.11.0
[AllTalk ENG] PyTorch Version   : 2.2.1+cu121
[AllTalk ENG] CUDA Version      : 12.1
[AllTalk ENG]
[AllTalk ENG] Model/Engine : xttsv2_2.0.3 loading into cuda
[AllTalk TTS] Warning TTS Engine has NOT started up yet. Will keep trying for 240 seconds maximum. Please wait.
[AllTalk TTS] Warning Mechanical hard drives and a slow PCI BUS are examples of things that can affect load times.
[AllTalk TTS] Warning Some TTS engines index their AI TTS models on loading, which can be slow on CPU or old systems.
[AllTalk TTS] Warning Using one of the other TTS engines on slower systems can help ease this issue.
[AllTalk ENG] Model License: https://coqui.ai/cpml.txt
[AllTalk ENG] Load time : 32.06 seconds.
[AllTalk TTS]
[AllTalk TTS] API Address : 127.0.0.1:7851
[AllTalk TTS] Gradio Light: http://127.0.0.1:7852
[AllTalk TTS] Gradio Dark : http://127.0.0.1:7852?__theme=dark
[AllTalk TTS]
themes/theme_schema@0.0.1.json: 100%|████████████████████████████████████████████| 13.1k/13.1k [00:00<00:00, 13.1MB/s]
[AllTalk TTS] Please use Ctrl+C when exiting AllTalk otherwise a
[AllTalk TTS] subprocess may continue running in the background.
[AllTalk TTS]
[AllTalk TTS] AllTalk Server Ready
nitinmukesh commented 2 months ago

Installation successful and server started and launched the UI.