lenML / Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
https://huggingface.co/spaces/lenML/ChatTTS-Forge
GNU Affero General Public License v3.0
711 stars 87 forks source link

FileNotFoundError: [WinError 2] 系统找不到指定的文件。 #26

Closed QiYusheng closed 4 months ago

QiYusheng commented 4 months ago

Running on local URL: http://0.0.0.0:7860 INFO:httpx:HTTP Request: GET http://localhost:7860/startup-events "HTTP/1.1 200 OK" INFO:httpx:HTTP Request: HEAD http://localhost:7860/ "HTTP/1.1 200 OK"

To create a public link, set share=True in launch(). INFO:modules.models:Loading ChatTTS models INFO:modules.ChatTTS.ChatTTS.core:Load from local: ./models/ChatTTS INFO:modules.ChatTTS.ChatTTS.core:vocos loaded. INFO:modules.ChatTTS.ChatTTS.core:dvae loaded. INFO:modules.ChatTTS.ChatTTS.core:gpt loaded. INFO:modules.ChatTTS.ChatTTS.core:decoder loaded. INFO:modules.ChatTTS.ChatTTS.core:tokenizer loaded. INFO:modules.ChatTTS.ChatTTS.core:All initialized. INFO:modules.models:ChatTTS models loaded INFO:modules.generate_audio:('spk', 'female2') INFO:modules.generate_audio:{'text': ['chat T T S 是一款强大的对话式文本转语音模型。它有中英混读和多说话人的能力。'], 'infer_seed': 42, 'temperature': 0.3, 'top_P': 0.7, 'top_K': 20, 'prompt1': '', 'prompt2': '', 'prefix': ''} 0%| | 0/2048 [00:00<?, ?it/s]D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\transformers\models\llama\modeling_llama.py:649: UserWarning: 1Torch was not compiled with flash attention. (Triggered internally at ..\aten\src\ATen\native\transformers\cuda\sdp_utils.cpp:263.) attn_output = torch.nn.functional.scaled_dot_product_attention( 100%|█████████████████████████████████████████████████████████████████████████████| 2048/2048 [00:02<00:00, 685.41it/s] D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\pydub\utils.py:198: RuntimeWarning: Couldn't find ffprobe or avprobe - defaulting to ffprobe, but may not work warn("Couldn't find ffprobe or avprobe - defaulting to ffprobe, but may not work", RuntimeWarning) Traceback (most recent call last): File "D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\gradio\queueing.py", line 532, in process_events response = await route_utils.call_process_api( File "D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\gradio\route_utils.py", line 276, in call_process_api output = await app.get_blocks().process_api( File "D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\gradio\blocks.py", line 1928, in process_api result = await self.call_function( File "D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\gradio\blocks.py", line 1514, in call_function prediction = await anyio.to_thread.run_sync( File "D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\anyio\to_thread.py", line 56, in run_sync return await get_async_backend().run_sync_in_worker_thread( File "D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\anyio_backends_asyncio.py", line 2177, in run_sync_in_worker_thread return await future File "D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\anyio_backends_asyncio.py", line 859, in run result = context.run(func, args) File "D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\gradio\utils.py", line 832, in wrapper response = f(args, **kwargs) File "D:\chatTTS\ChatTTS-Forge\modules\webui\webui_utils.py", line 205, in tts_generate sample_rate, audio_data = synthesize_audio( File "D:\chatTTS\ChatTTS-Forge\modules\synthesize_audio.py", line 63, in synthesize_audio audio_segments = synthesizer.synthesize_segments(text_segments) File "D:\chatTTS\ChatTTS-Forge\modules\SynthesizeSegments.py", line 219, in synthesize_segments self.process_voice_segments(segments, bucket, audio_segments) File "D:\chatTTS\ChatTTS-Forge\modules\SynthesizeSegments.py", line 179, in process_voice_segments audio_segment = audio_data_to_segment(audio_data, sr) File "D:\chatTTS\ChatTTS-Forge\modules\SynthesizeSegments.py", line 25, in audio_data_to_segment return AudioSegment.from_file(byte_io, format="wav") File "D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\pydub\audio_segment.py", line 728, in from_file info = mediainfo_json(orig_file, read_ahead_limit=read_ahead_limit) File "D:\chatTTS\ChatTTS-Forge\venv\lib\site-packages\pydub\utils.py", line 274, in mediainfo_json res = Popen(command, stdin=stdin_parameter, stdout=PIPE, stderr=PIPE) File "C:\Users\Admin\AppData\Local\Programs\Python\Python310\lib\subprocess.py", line 969, in init self._execute_child(args, executable, preexec_fn, close_fds, File "C:\Users\Admin\AppData\Local\Programs\Python\Python310\lib\subprocess.py", line 1438, in _execute_child hp, ht, pid, tid = _winapi.CreateProcess(executable, args, FileNotFoundError: [WinError 2] 系统找不到指定的文件。


各种版本号如下↓ 🍦 [ChatTTS-Forge]) version: [v0.5.5-36-gb34a0f8] | branch: main | python: 3.10.6 | torch: 2.2.2+cu121

zhzLuke96 commented 4 months ago

检查运行环境是否有 ffmpeg

ffmpeg下载地址 https://www.ffmpeg.org/download.html#build-windows

QiYusheng commented 4 months ago

同样是win系统,有好多版本,具体要下载那个版本呢?

zhzLuke96 commented 4 months ago

同样是win系统,有好多版本,具体要下载那个版本呢?

参考这个文档 https://github.com/lenML/ChatTTS-Forge/blob/main/docs/dependencies.md#%E5%AE%89%E8%A3%85-ffmpeg