Kedreamix / Linly-Dubbing

智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”
Apache License 2.0
1.39k stars 115 forks source link

可以请求一键包吗?或者能提供抱脸的试玩吗😁 #5

Closed aimarxjg closed 1 week ago

aimarxjg commented 3 weeks ago

谢谢大佬~

jerry0908mo commented 3 weeks ago

+1

还是你会提需求,我搭建这个从9点多肝到了天大亮,webui起来了,但是到了识别语音的步骤就是报错。 你这需求提了,省老事了。

Kedreamix commented 3 weeks ago

我会尝试一下,不过有时候主要是可能用到gpu,我得多测试一下,但是正常一点的应该是哦OK的

Kedreamix commented 3 weeks ago

@jerry0908mo 识别语音是哪一部分有问题呢,如果没有设置hf_token,可以关闭人声分离选项

jerry0908mo commented 3 weeks ago

@jerry0908mo 识别语音是哪一部分有问题呢,如果没有设置hf_token,可以关闭人声分离选项

刚刚更新了最新的代码,重新下载了需要的模型。运行webui后,点击智能语音识别后,出现这样的错误。

2024-08-19 20:45:27.046 | INFO     | tools.step020_asr:transcribe_audio:70 - Transcribing videos\村长台钓加拿大\20240805 英文无字幕 阿里这小子在水城威尼斯发来问候\audio_vocals.wav
2024-08-19 20:45:27.046 | INFO     | tools.step021_asr_whisperx:load_whisper_model:34 - Loading WhisperX model: tiny
config.json: 100%|█████████████████████████████████████████████████████████████████████| 2.25k/2.25k [00:00<?, ?B/s]
vocabulary.txt: 100%|█████████████████████████████████████████████████████████████| 460k/460k [00:01<00:00, 346kB/s]
tokenizer.json: 100%|███████████████████████████████████████████████████████████| 2.20M/2.20M [00:02<00:00, 761kB/s]
model.bin: 100%|███████████████████████████████████████████████████████████████| 75.5M/75.5M [00:27<00:00, 2.74MB/s]
No language specified, language will be first be detected for each audio file (increases inference time).0, 761kB/s]
Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\queueing.py", line 536, in process_events
    response = await route_utils.call_process_api(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\route_utils.py", line 288, in call_process_api
    output = await app.get_blocks().process_api(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\blocks.py", line 1931, in process_api
    result = await self.call_function(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\blocks.py", line 1516, in call_function    prediction = await anyio.to_thread.run_sync(  # type: ignore
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\anyio\to_thread.py", line 56, in run_sync
    return await get_async_backend().run_sync_in_worker_thread(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\anyio\_backends\_asyncio.py", line 2177, in run_sync_in_worker_thread
    return await future
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\anyio\_backends\_asyncio.py", line 859, in run    result = context.run(func, *args)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\utils.py", line 826, in wrapper
    response = f(*args, **kwargs)
  File "F:\code_source\Linly-Dubbing\tools\step020_asr.py", line 93, in transcribe_all_audio_under_folder
    transcribe_json = transcribe_audio(asr_method, root, whisper_model_name, 'models/ASR/whisper', device, batch_size, diarization, min_speakers, max_speakers)
  File "F:\code_source\Linly-Dubbing\tools\step020_asr.py", line 75, in transcribe_audio
    transcript = whisperx_transcribe_audio(wav_path, model_name, download_root, device, batch_size, diarization, min_speakers, max_speakers)
  File "F:\code_source\Linly-Dubbing\tools\step021_asr_whisperx.py", line 73, in whisperx_transcribe_audio
    load_whisper_model(model_name, download_root, device)
  File "F:\code_source\Linly-Dubbing\tools\step021_asr_whisperx.py", line 36, in load_whisper_model
    whisper_model = whisperx.load_model(model_name, download_root=download_root, device=device)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\whisperx\asr.py", line 347, in load_model
    vad_model = load_vad_model(torch.device(device), use_auth_token=None, **default_vad_options)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\whisperx\vad.py", line 47, in load_vad_model
    raise RuntimeError(
RuntimeError: Model has been downloaded but the SHA256 checksum does not not match. Please retry loading the model.

另外我下载模型后,执行 python webui.py, 会有下面的错误提示

(linly_dubbing) F:\code_source\Linly-Dubbing>python webui.py
D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\pyannote\audio\core\io.py:43: UserWarning: torchaudio._backend.set_audio_backend has been deprecated. With dispatcher enabled, this function is no-op. You can remove the function call.
  torchaudio.set_audio_backend("soundfile")
failed to import ttsfrd, use WeTextProcessing instead
Running on local URL:  http://127.0.0.1:6006

To create a public link, set `share=True` in `launch()`.
Exception in thread Thread-3 (_do_normal_analytics_request):
Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 69, in map_httpcore_exceptions
    yield
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 233, in handle_request
    resp = self._pool.handle_request(req)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 216, in handle_request
    raise exc from None
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 196, in handle_request
    response = connection.handle_request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http_proxy.py", line 317, in handle_request
    stream = stream.start_tls(**kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http11.py", line 383, in start_tls
    return self._stream.start_tls(ssl_context, server_hostname, timeout)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_backends\sync.py", line 152, in start_tls
    with map_exceptions(exc_map):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_exceptions.py", line 14, in map_exceptions
    raise to_exc(exc) from exc
httpcore.ConnectTimeout: _ssl.c:990: The handshake operation timed out

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 1016, in _bootstrap_inner
    self.run()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 953, in run
    self._target(*self._args, **self._kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 70, in _do_normal_analytics_request
    data["ip_address"] = get_local_ip_address()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 131, in get_local_ip_address
    ip_address = httpx.get(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 198, in get
    return request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 106, in request
    return client.request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 827, in request
    return self.send(request, auth=auth, follow_redirects=follow_redirects)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 914, in send
    response = self._send_handling_auth(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 942, in _send_handling_auth
    response = self._send_handling_redirects(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 979, in _send_handling_redirects
    response = self._send_single_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 1015, in _send_single_request
    response = transport.handle_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 232, in handle_request
    with map_httpcore_exceptions():
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 86, in map_httpcore_exceptions
    raise mapped_exc(message) from exc
httpx.ConnectTimeout: _ssl.c:990: The handshake operation timed out
Exception in thread Thread-5 (_do_normal_analytics_request):
Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 69, in map_httpcore_exceptions
    yield
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 233, in handle_request
    resp = self._pool.handle_request(req)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 216, in handle_request
    raise exc from None
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 196, in handle_request
    response = connection.handle_request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http_proxy.py", line 317, in handle_request
    stream = stream.start_tls(**kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http11.py", line 383, in start_tls
    return self._stream.start_tls(ssl_context, server_hostname, timeout)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_backends\sync.py", line 152, in start_tls
    with map_exceptions(exc_map):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_exceptions.py", line 14, in map_exceptions
    raise to_exc(exc) from exc
httpcore.ConnectTimeout: _ssl.c:990: The handshake operation timed out

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 1016, in _bootstrap_inner
    self.run()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 953, in run
    self._target(*self._args, **self._kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 70, in _do_normal_analytics_request
    data["ip_address"] = get_local_ip_address()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 131, in get_local_ip_address
    ip_address = httpx.get(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 198, in get
    return request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 106, in request
    return client.request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 827, in request
    return self.send(request, auth=auth, follow_redirects=follow_redirects)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 914, in send
    response = self._send_handling_auth(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 942, in _send_handling_auth
    response = self._send_handling_redirects(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 979, in _send_handling_redirects
    response = self._send_single_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 1015, in _send_single_request
    response = transport.handle_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 232, in handle_request
    with map_httpcore_exceptions():
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 86, in map_httpcore_exceptions
    raise mapped_exc(message) from exc
httpx.ConnectTimeout: _ssl.c:990: The handshake operation timed out
Exception in thread Thread-7 (_do_normal_analytics_request):
Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 69, in map_httpcore_exceptions
    yield
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 233, in handle_request
    resp = self._pool.handle_request(req)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 216, in handle_request
    raise exc from None
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 196, in handle_request
    response = connection.handle_request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http_proxy.py", line 317, in handle_request
    stream = stream.start_tls(**kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http11.py", line 383, in start_tls
    return self._stream.start_tls(ssl_context, server_hostname, timeout)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_backends\sync.py", line 152, in start_tls
    with map_exceptions(exc_map):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_exceptions.py", line 14, in map_exceptions
    raise to_exc(exc) from exc
httpcore.ConnectTimeout: _ssl.c:990: The handshake operation timed out

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 1016, in _bootstrap_inner
    self.run()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 953, in run
    self._target(*self._args, **self._kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 70, in _do_normal_analytics_request
    data["ip_address"] = get_local_ip_address()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 131, in get_local_ip_address
    ip_address = httpx.get(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 198, in get
    return request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 106, in request
    return client.request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 827, in request
    return self.send(request, auth=auth, follow_redirects=follow_redirects)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 914, in send
    response = self._send_handling_auth(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 942, in _send_handling_auth
    response = self._send_handling_redirects(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 979, in _send_handling_redirects
    response = self._send_single_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 1015, in _send_single_request
    response = transport.handle_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 232, in handle_request
    with map_httpcore_exceptions():
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 86, in map_httpcore_exceptions
    raise mapped_exc(message) from exc
httpx.ConnectTimeout: _ssl.c:990: The handshake operation timed out
Exception in thread Thread-9 (_do_normal_analytics_request):
Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 69, in map_httpcore_exceptions
    yield
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 233, in handle_request
    resp = self._pool.handle_request(req)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 216, in handle_request
    raise exc from None
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 196, in handle_request
    response = connection.handle_request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http_proxy.py", line 317, in handle_request
    stream = stream.start_tls(**kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http11.py", line 383, in start_tls
    return self._stream.start_tls(ssl_context, server_hostname, timeout)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_backends\sync.py", line 152, in start_tls
    with map_exceptions(exc_map):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_exceptions.py", line 14, in map_exceptions
    raise to_exc(exc) from exc
httpcore.ConnectTimeout: _ssl.c:990: The handshake operation timed out

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 1016, in _bootstrap_inner
    self.run()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 953, in run
    self._target(*self._args, **self._kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 70, in _do_normal_analytics_request
    data["ip_address"] = get_local_ip_address()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 131, in get_local_ip_address
    ip_address = httpx.get(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 198, in get
    return request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 106, in request
    return client.request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 827, in request
    return self.send(request, auth=auth, follow_redirects=follow_redirects)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 914, in send
    response = self._send_handling_auth(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 942, in _send_handling_auth
    response = self._send_handling_redirects(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 979, in _send_handling_redirects
    response = self._send_single_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 1015, in _send_single_request
    response = transport.handle_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 232, in handle_request
    with map_httpcore_exceptions():
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 86, in map_httpcore_exceptions
    raise mapped_exc(message) from exc
httpx.ConnectTimeout: _ssl.c:990: The handshake operation timed out

需要说明的是,我把之前处理好的 video 目录下的文件没有删除,直接从开始 智能识别语音开始的。

tangcan1600 commented 3 weeks ago

@jerry0908mo 识别语音是哪一部分有问题呢,如果没有设置hf_token,可以关闭人声分离选项

设置了hf_token也报错,不知道是token权限不对还是网络问题,人声分离在哪里关闭,我翻半天说明和原码,也没看关闭的api,是直接把load_diarize_model()注释掉吗

Kedreamix commented 3 weeks ago

你如果使用webui的话,你可以点击那个关于人声分离的旋钮,可以直接点击取消的

Kedreamix commented 3 weeks ago

@jerry0908mo 识别语音是哪一部分有问题呢,如果没有设置hf_token,可以关闭人声分离选项

刚刚更新了最新的代码,重新下载了需要的模型。运行webui后,点击智能语音识别后,出现这样的错误。

2024-08-19 20:45:27.046 | INFO     | tools.step020_asr:transcribe_audio:70 - Transcribing videos\村长台钓加拿大\20240805 英文无字幕 阿里这小子在水城威尼斯发来问候\audio_vocals.wav
2024-08-19 20:45:27.046 | INFO     | tools.step021_asr_whisperx:load_whisper_model:34 - Loading WhisperX model: tiny
config.json: 100%|█████████████████████████████████████████████████████████████████████| 2.25k/2.25k [00:00<?, ?B/s]
vocabulary.txt: 100%|█████████████████████████████████████████████████████████████| 460k/460k [00:01<00:00, 346kB/s]
tokenizer.json: 100%|███████████████████████████████████████████████████████████| 2.20M/2.20M [00:02<00:00, 761kB/s]
model.bin: 100%|███████████████████████████████████████████████████████████████| 75.5M/75.5M [00:27<00:00, 2.74MB/s]
No language specified, language will be first be detected for each audio file (increases inference time).0, 761kB/s]
Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\queueing.py", line 536, in process_events
    response = await route_utils.call_process_api(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\route_utils.py", line 288, in call_process_api
    output = await app.get_blocks().process_api(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\blocks.py", line 1931, in process_api
    result = await self.call_function(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\blocks.py", line 1516, in call_function    prediction = await anyio.to_thread.run_sync(  # type: ignore
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\anyio\to_thread.py", line 56, in run_sync
    return await get_async_backend().run_sync_in_worker_thread(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\anyio\_backends\_asyncio.py", line 2177, in run_sync_in_worker_thread
    return await future
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\anyio\_backends\_asyncio.py", line 859, in run    result = context.run(func, *args)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\utils.py", line 826, in wrapper
    response = f(*args, **kwargs)
  File "F:\code_source\Linly-Dubbing\tools\step020_asr.py", line 93, in transcribe_all_audio_under_folder
    transcribe_json = transcribe_audio(asr_method, root, whisper_model_name, 'models/ASR/whisper', device, batch_size, diarization, min_speakers, max_speakers)
  File "F:\code_source\Linly-Dubbing\tools\step020_asr.py", line 75, in transcribe_audio
    transcript = whisperx_transcribe_audio(wav_path, model_name, download_root, device, batch_size, diarization, min_speakers, max_speakers)
  File "F:\code_source\Linly-Dubbing\tools\step021_asr_whisperx.py", line 73, in whisperx_transcribe_audio
    load_whisper_model(model_name, download_root, device)
  File "F:\code_source\Linly-Dubbing\tools\step021_asr_whisperx.py", line 36, in load_whisper_model
    whisper_model = whisperx.load_model(model_name, download_root=download_root, device=device)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\whisperx\asr.py", line 347, in load_model
    vad_model = load_vad_model(torch.device(device), use_auth_token=None, **default_vad_options)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\whisperx\vad.py", line 47, in load_vad_model
    raise RuntimeError(
RuntimeError: Model has been downloaded but the SHA256 checksum does not not match. Please retry loading the model.

另外我下载模型后,执行 python webui.py, 会有下面的错误提示

(linly_dubbing) F:\code_source\Linly-Dubbing>python webui.py
D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\pyannote\audio\core\io.py:43: UserWarning: torchaudio._backend.set_audio_backend has been deprecated. With dispatcher enabled, this function is no-op. You can remove the function call.
  torchaudio.set_audio_backend("soundfile")
failed to import ttsfrd, use WeTextProcessing instead
Running on local URL:  http://127.0.0.1:6006

To create a public link, set `share=True` in `launch()`.
Exception in thread Thread-3 (_do_normal_analytics_request):
Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 69, in map_httpcore_exceptions
    yield
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 233, in handle_request
    resp = self._pool.handle_request(req)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 216, in handle_request
    raise exc from None
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 196, in handle_request
    response = connection.handle_request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http_proxy.py", line 317, in handle_request
    stream = stream.start_tls(**kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http11.py", line 383, in start_tls
    return self._stream.start_tls(ssl_context, server_hostname, timeout)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_backends\sync.py", line 152, in start_tls
    with map_exceptions(exc_map):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_exceptions.py", line 14, in map_exceptions
    raise to_exc(exc) from exc
httpcore.ConnectTimeout: _ssl.c:990: The handshake operation timed out

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 1016, in _bootstrap_inner
    self.run()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 953, in run
    self._target(*self._args, **self._kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 70, in _do_normal_analytics_request
    data["ip_address"] = get_local_ip_address()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 131, in get_local_ip_address
    ip_address = httpx.get(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 198, in get
    return request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 106, in request
    return client.request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 827, in request
    return self.send(request, auth=auth, follow_redirects=follow_redirects)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 914, in send
    response = self._send_handling_auth(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 942, in _send_handling_auth
    response = self._send_handling_redirects(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 979, in _send_handling_redirects
    response = self._send_single_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 1015, in _send_single_request
    response = transport.handle_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 232, in handle_request
    with map_httpcore_exceptions():
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 86, in map_httpcore_exceptions
    raise mapped_exc(message) from exc
httpx.ConnectTimeout: _ssl.c:990: The handshake operation timed out
Exception in thread Thread-5 (_do_normal_analytics_request):
Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 69, in map_httpcore_exceptions
    yield
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 233, in handle_request
    resp = self._pool.handle_request(req)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 216, in handle_request
    raise exc from None
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 196, in handle_request
    response = connection.handle_request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http_proxy.py", line 317, in handle_request
    stream = stream.start_tls(**kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http11.py", line 383, in start_tls
    return self._stream.start_tls(ssl_context, server_hostname, timeout)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_backends\sync.py", line 152, in start_tls
    with map_exceptions(exc_map):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_exceptions.py", line 14, in map_exceptions
    raise to_exc(exc) from exc
httpcore.ConnectTimeout: _ssl.c:990: The handshake operation timed out

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 1016, in _bootstrap_inner
    self.run()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 953, in run
    self._target(*self._args, **self._kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 70, in _do_normal_analytics_request
    data["ip_address"] = get_local_ip_address()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 131, in get_local_ip_address
    ip_address = httpx.get(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 198, in get
    return request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 106, in request
    return client.request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 827, in request
    return self.send(request, auth=auth, follow_redirects=follow_redirects)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 914, in send
    response = self._send_handling_auth(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 942, in _send_handling_auth
    response = self._send_handling_redirects(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 979, in _send_handling_redirects
    response = self._send_single_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 1015, in _send_single_request
    response = transport.handle_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 232, in handle_request
    with map_httpcore_exceptions():
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 86, in map_httpcore_exceptions
    raise mapped_exc(message) from exc
httpx.ConnectTimeout: _ssl.c:990: The handshake operation timed out
Exception in thread Thread-7 (_do_normal_analytics_request):
Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 69, in map_httpcore_exceptions
    yield
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 233, in handle_request
    resp = self._pool.handle_request(req)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 216, in handle_request
    raise exc from None
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 196, in handle_request
    response = connection.handle_request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http_proxy.py", line 317, in handle_request
    stream = stream.start_tls(**kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http11.py", line 383, in start_tls
    return self._stream.start_tls(ssl_context, server_hostname, timeout)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_backends\sync.py", line 152, in start_tls
    with map_exceptions(exc_map):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_exceptions.py", line 14, in map_exceptions
    raise to_exc(exc) from exc
httpcore.ConnectTimeout: _ssl.c:990: The handshake operation timed out

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 1016, in _bootstrap_inner
    self.run()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 953, in run
    self._target(*self._args, **self._kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 70, in _do_normal_analytics_request
    data["ip_address"] = get_local_ip_address()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 131, in get_local_ip_address
    ip_address = httpx.get(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 198, in get
    return request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 106, in request
    return client.request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 827, in request
    return self.send(request, auth=auth, follow_redirects=follow_redirects)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 914, in send
    response = self._send_handling_auth(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 942, in _send_handling_auth
    response = self._send_handling_redirects(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 979, in _send_handling_redirects
    response = self._send_single_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 1015, in _send_single_request
    response = transport.handle_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 232, in handle_request
    with map_httpcore_exceptions():
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 86, in map_httpcore_exceptions
    raise mapped_exc(message) from exc
httpx.ConnectTimeout: _ssl.c:990: The handshake operation timed out
Exception in thread Thread-9 (_do_normal_analytics_request):
Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 69, in map_httpcore_exceptions
    yield
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 233, in handle_request
    resp = self._pool.handle_request(req)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 216, in handle_request
    raise exc from None
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\connection_pool.py", line 196, in handle_request
    response = connection.handle_request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http_proxy.py", line 317, in handle_request
    stream = stream.start_tls(**kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_sync\http11.py", line 383, in start_tls
    return self._stream.start_tls(ssl_context, server_hostname, timeout)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_backends\sync.py", line 152, in start_tls
    with map_exceptions(exc_map):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpcore\_exceptions.py", line 14, in map_exceptions
    raise to_exc(exc) from exc
httpcore.ConnectTimeout: _ssl.c:990: The handshake operation timed out

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 1016, in _bootstrap_inner
    self.run()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\threading.py", line 953, in run
    self._target(*self._args, **self._kwargs)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 70, in _do_normal_analytics_request
    data["ip_address"] = get_local_ip_address()
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\gradio\analytics.py", line 131, in get_local_ip_address
    ip_address = httpx.get(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 198, in get
    return request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_api.py", line 106, in request
    return client.request(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 827, in request
    return self.send(request, auth=auth, follow_redirects=follow_redirects)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 914, in send
    response = self._send_handling_auth(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 942, in _send_handling_auth
    response = self._send_handling_redirects(
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 979, in _send_handling_redirects
    response = self._send_single_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_client.py", line 1015, in _send_single_request
    response = transport.handle_request(request)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 232, in handle_request
    with map_httpcore_exceptions():
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\contextlib.py", line 153, in __exit__
    self.gen.throw(typ, value, traceback)
  File "D:\ProgramData\miniconda3\envs\linly_dubbing\lib\site-packages\httpx\_transports\default.py", line 86, in map_httpcore_exceptions
    raise mapped_exc(message) from exc
httpx.ConnectTimeout: _ssl.c:990: The handshake operation timed out

需要说明的是,我把之前处理好的 video 目录下的文件没有删除,直接从开始 智能识别语音开始的。

我看了下你的问题,第一个问题是,自动下载了whisper的tiny模型,但是可能自动下载的时候有一些小问题,所以出现了bug 第二个不是问题,只是gradio的一些链接方式,应该是OK的能正常使用的

tangcan1600 commented 3 weeks ago

@Kedreamix 你如果使用webui的话,你可以点击那个关于人声分离的旋钮,可以直接点击取消的 旋钮?选项框里的单选按钮吗,取消不了,只能一键clear

Kedreamix commented 3 weeks ago

已经更新了colab版本了,后续会更新抱脸的试玩的

Kedreamix commented 3 weeks ago

@Kedreamix 你如果使用webui的话,你可以点击那个关于人声分离的旋钮,可以直接点击取消的 旋钮?选项框里的单选按钮吗,取消不了,只能一键clear

这里应该是分离说话人那个按钮,去掉即可,不过我也更新了代码,你可以重新试一下

image
xinzhu666 commented 3 weeks ago

我看了下你的问题,第一个问题是,自动下载了whisper的tiny模型,但是可能自动下载的时候有一些小问题,所以出现了bug 第二个不是问题,只是gradio的一些链接方式,应该是OK的能正常使用的

我一直都是这个问题啊......models文件夹全删了,再运行,下载好后也是报“RuntimeError: Model has been downloaded but the SHA256 checksum does not not match. Please retry loading the model.”

Kedreamix commented 3 weeks ago

我在scripts下提供了脚本下载whisper的模型,你可以直接下载,用默认的方式进行智能语音识别,这样应该是没有问题的

Phoebe-dot commented 3 weeks ago

+1

还是你会提需求,我搭建这个从9点多肝到了天大亮,webui起来了,但是到了识别语音的步骤就是报错。 你这需求提了,省老事了。

一样 哭了

xinzhu666 commented 3 weeks ago

我在scripts下提供了脚本下载whisper的模型,你可以直接下载,用默认的方式进行智能语音识别,这样应该是没有问题的

我重新起了一个项目,script/download.sh不管是用modelscope和huggingface下载都是一样的报错

2024-08-22 23:19:10.583 | ERROR    | tools.do_everything:process_video:66 - Error processing video (英文无字幕) 阿里这小子在水城威尼斯发来问候: Model has been downloaded but the SHA256 checksum does not not match. Please retry loading the model.

配置: image

报错信息截图: image

dhlsam commented 3 weeks ago

可以增加本地 docker部署么

Kedreamix commented 3 weeks ago

我在scripts下提供了脚本下载whisper的模型,你可以直接下载,用默认的方式进行智能语音识别,这样应该是没有问题的

我重新起了一个项目,script/download.sh不管是用modelscope和huggingface下载都是一样的报错

2024-08-22 23:19:10.583 | ERROR    | tools.do_everything:process_video:66 - Error processing video (英文无字幕) 阿里这小子在水城威尼斯发来问候: Model has been downloaded but the SHA256 checksum does not not match. Please retry loading the model.

配置: image

报错信息截图: image

你好,你是没下载成功么,还是已经下载成功了,希望可以看看对应的文件是不是下载成功了

Kedreamix commented 3 weeks ago

@dhlsam 好滴,后续会增加docker部署方式

xinzhu666 commented 1 week ago

你好,你是没下载成功么,还是已经下载成功了,希望可以看看对应的文件是不是下载成功了

你好,文件树如下,下面也有sha256值:

/home/Linly-Dubbing/models
├── ASR
│   └── whisper
│       ├── faster-whisper-large-v3
│       │   ├── config.json
│       │   ├── model.bin
│       │   ├── preprocessor_config.json
│       │   ├── README.md
│       │   ├── tokenizer.json
│       │   └── vocabulary.json
│       └── wav2vec2_fairseq_base_ls960_asr_ls960.pth
└── TTS
    └── XTTS-v2
        ├── 1f69422a8a8f344c4fca2f0c6b8d41d2151d6615b7321e48e6bb15ae949b119c
        ├── b29bc227d410d4991e0a8c09b858f77415013eeb9fba9650258e96095557d97a
        ├── c7ea20001c6a0a841c77e252d8409f6a74fb423e79b3206a0771ba5989776187
        ├── config.1.json
        ├── config.json
        ├── configuration.json
        ├── dvae.pth
        ├── f0f6137c19a4eab0cbbe4c99b5babacf68b1746e50da90807708c10e645b943b
        ├── hash.1.md5
        ├── hash.md5
        ├── LICENSE.1.txt
        ├── LICENSE.txt
        ├── mel_stats.pth
        ├── model.pth
        ├── README.1.md
        ├── README.md
        ├── samples
        │   ├── de_sample.1.wav
        │   ├── de_sample.wav
        │   ├── en_sample.1.wav
        │   ├── en_sample.wav
        │   ├── es_sample.1.wav
        │   ├── es_sample.wav
        │   ├── fr_sample.1.wav
        │   ├── fr_sample.wav
        │   ├── ja-sample.1.wav
        │   ├── ja-sample.wav
        │   ├── pt_sample.1.wav
        │   ├── pt_sample.wav
        │   ├── tr_sample.1.wav
        │   ├── tr_sample.wav
        │   ├── zh-cn-sample.1.wav
        │   └── zh-cn-sample.wav
        ├── speakers_xtts.pth
        ├── vocab.1.json
        └── vocab.json
对应文件的sha256值: 文件名 大小 (字节) SHA-256
/path/models/ASR/whisper/faster-whisper-large-v3/model.bin 3087284237 69f74147e3334731bc3a76048724833325d2ec74642fb52620eda87352e3d4f1
/path/models/TTS/XTTS-v2/c7ea20001c6a0a841c77e252d8409f6a74fb423e79b3206a0771ba5989776187 1867929118 c7ea20001c6a0a841c77e252d8409f6a74fb423e79b3206a0771ba5989776187
/path/models/TTS/XTTS-v2/model.pth 1867929118 c7ea20001c6a0a841c77e252d8409f6a74fb423e79b3206a0771ba5989776187
/path/models/ASR/whisper/wav2vec2_fairseq_base_ls960_asr_ls960.pth 377664473 488fd4f16de84438ffc945334278c1b9fb9b7159a806c1080b16111a958c945d
/path/models/TTS/XTTS-v2/b29bc227d410d4991e0a8c09b858f77415013eeb9fba9650258e96095557d97a 210514388 b29bc227d410d4991e0a8c09b858f77415013eeb9fba9650258e96095557d97a
/path/models/TTS/XTTS-v2/dvae.pth 210514388 b29bc227d410d4991e0a8c09b858f77415013eeb9fba9650258e96095557d97a
/path/models/TTS/XTTS-v2/f0f6137c19a4eab0cbbe4c99b5babacf68b1746e50da90807708c10e645b943b 7754818 f0f6137c19a4eab0cbbe4c99b5babacf68b1746e50da90807708c10e645b943b
/path/models/TTS/XTTS-v2/speakers_xtts.pth 7754818 f0f6137c19a4eab0cbbe4c99b5babacf68b1746e50da90807708c10e645b943b
/path/models/ASR/whisper/faster-whisper-large-v3/tokenizer.json 2480617 6d8cbd7cd0d8d5815e478dac67b85a26bbe77c1f5e0c6d76d1ce2abc0e5f21ca
/path/models/ASR/whisper/faster-whisper-large-v3/vocabulary.json 1068114 c69260f2ab26d659b7c398f9a2b2b48ed0df16c3b47d7326782fd9cba71690c1
/path/models/TTS/XTTS-v2/samples/pt_sample.1.wav 445498 52deae2188abb9e257e3cb6c91bbd2bf6cd71b3e4c4a197b6c6b15c55ea6f15a
/path/models/TTS/XTTS-v2/samples/pt_sample.wav 445498 52deae2188abb9e257e3cb6c91bbd2bf6cd71b3e4c4a197b6c6b15c55ea6f15a
/path/models/TTS/XTTS-v2/samples/fr_sample.1.wav 432186 f2d86fada78ae2d062fdda550869f886f32a9072d79e43c8b4a892622c5279c8
/path/models/TTS/XTTS-v2/samples/fr_sample.wav 432186 f2d86fada78ae2d062fdda550869f886f32a9072d79e43c8b4a892622c5279c8
/path/models/TTS/XTTS-v2/samples/zh-cn-sample.1.wav 383034 36b230d0e8e8a17c8e3877962a07c522c339cacac97d5b9950b7a4d9c407ece6
/path/models/TTS/XTTS-v2/samples/zh-cn-sample.wav 383034 36b230d0e8e8a17c8e3877962a07c522c339cacac97d5b9950b7a4d9c407ece6
/path/models/TTS/XTTS-v2/vocab.1.json 361219 928260878a59da8a72a2a5b7687fea29d5106137669d90945430fe17e415304a
/path/models/TTS/XTTS-v2/vocab.json 361219 928260878a59da8a72a2a5b7687fea29d5106137669d90945430fe17e415304a
/path/models/TTS/XTTS-v2/samples/es_sample.1.wav 329786 85fdf2f76df1b468a43d2f2ee7e1b8a38e42ed23bb3d5aa41c9d872f1796f544
/path/models/TTS/XTTS-v2/samples/es_sample.wav 329786 85fdf2f76df1b468a43d2f2ee7e1b8a38e42ed23bb3d5aa41c9d872f1796f544
/path/models/TTS/XTTS-v2/samples/de_sample.1.wav 299066 c5c7190fa0ad2f940f31807b9db57ca4917e8ba7fc3dc6c4868c3727c76934a8
/path/models/TTS/XTTS-v2/samples/de_sample.wav 299066 c5c7190fa0ad2f940f31807b9db57ca4917e8ba7fc3dc6c4868c3727c76934a8
/path/models/TTS/XTTS-v2/samples/en_sample.1.wav 299066 2bb31e7bf2b1e6f98f21be9017f5daee44a40ca91126955485f430b272e1a8b5
/path/models/TTS/XTTS-v2/samples/en_sample.wav 299066 2bb31e7bf2b1e6f98f21be9017f5daee44a40ca91126955485f430b272e1a8b5
/path/models/TTS/XTTS-v2/samples/tr_sample.1.wav 299066 72d030a736fb9ada710109f4c7b2a0d425675607b1911e33b7af7070595273d3
/path/models/TTS/XTTS-v2/samples/tr_sample.wav 299066 72d030a736fb9ada710109f4c7b2a0d425675607b1911e33b7af7070595273d3
/path/models/TTS/XTTS-v2/samples/ja-sample.1.wav 284730 20bbc6c58e5221564e76ee40792bbc1fd48a4cb16bf5ff041644df14c635bbb4
/path/models/TTS/XTTS-v2/samples/ja-sample.wav 284730 20bbc6c58e5221564e76ee40792bbc1fd48a4cb16bf5ff041644df14c635bbb4
/path/models/TTS/XTTS-v2/config.1.json 4368 ef262b1454dd2a77e1461b0b2cd53e19b8a7624cc131b837d36df67356bc75e8
/path/models/TTS/XTTS-v2/config.json 4368 ef262b1454dd2a77e1461b0b2cd53e19b8a7624cc131b837d36df67356bc75e8
/path/models/TTS/XTTS-v2/README.1.md 4263 1cfa85b3293f685b3a6537f8da3d94820fd111270e553589073885dea3facfb7
/path/models/TTS/XTTS-v2/README.md 4263 1cfa85b3293f685b3a6537f8da3d94820fd111270e553589073885dea3facfb7
/path/models/TTS/XTTS-v2/LICENSE.1.txt 4014 190f6d7c19b8984f91b97712b94ce92d2b2e640fc677dacab966e955ece9d043
/path/models/TTS/XTTS-v2/LICENSE.txt 4014 190f6d7c19b8984f91b97712b94ce92d2b2e640fc677dacab966e955ece9d043
/path/models/ASR/whisper/faster-whisper-large-v3/config.json 2394 a9306624f5ec14270a014b647e5c316b6e03a662c369758d1b90697a7b0655b9
/path/models/ASR/whisper/faster-whisper-large-v3/README.md 2052 39e96252229f5a3d0141dc81afb65a36fd205461ac21e5b70f2cd1248ef0082c
/path/models/ASR/whisper/faster-whisper-large-v3/.gitattributes 1519 11ad7efa24975ee4b0c3c3a38ed18737f0658a5f75a0a96787b576a78a023361
/path/models/TTS/XTTS-v2/.gitattributes.1 1519 11ad7efa24975ee4b0c3c3a38ed18737f0658a5f75a0a96787b576a78a023361
/path/models/TTS/XTTS-v2/.gitattributes 1519 11ad7efa24975ee4b0c3c3a38ed18737f0658a5f75a0a96787b576a78a023361
/path/models/TTS/XTTS-v2/.msc 1216 dcac9485923bbc8c1020e92db6193d8ae93dd08d2216b014b131b99a9e333114
/path/models/TTS/XTTS-v2/1f69422a8a8f344c4fca2f0c6b8d41d2151d6615b7321e48e6bb15ae949b119c 1067 1f69422a8a8f344c4fca2f0c6b8d41d2151d6615b7321e48e6bb15ae949b119c
/path/models/TTS/XTTS-v2/mel_stats.pth 1067 1f69422a8a8f344c4fca2f0c6b8d41d2151d6615b7321e48e6bb15ae949b119c
/path/models/ASR/whisper/faster-whisper-large-v3/preprocessor_config.json 340 7ccc62c6f2765af1f3b46c00c9b5894426835a05021c8b9c01eecb6dfb542711
/path/models/TTS/XTTS-v2/.cache/huggingface/download/dvae.pth.metadata 125 f70cd35e46d83fa0a4740e04b3d169520028baae6bacf2c73d8d61323710cf43
/path/models/TTS/XTTS-v2/.cache/huggingface/download/mel_stats.pth.metadata 125 6a7bfc980f557b9b4ccd2ac5ee70a0a2290e1b53fcbea468f8799d5651aecf8a
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/model.bin.metadata 124 f4ad40f07024447043da871ce0191325cd9fb469922a4ad1757b0c00d290418b
/path/models/TTS/XTTS-v2/.cache/huggingface/download/model.pth.metadata 124 49a5cf1b22bf7168af84869b01219ac7d8afe31c3d3f9c8cc595ac9019eea4ca
/path/models/TTS/XTTS-v2/.cache/huggingface/download/speakers_xtts.pth.metadata 124 414ef78bec0d37c72e0b5ed27298288571cad26c84b85d00ec9820bd7012d043
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/config.json.metadata 101 1338bf12314828b2f210ed10e37de9d19ca732e124020784bb0eeed0f6ba46a0
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/.gitattributes.metadata 101 64afd4a2bb1063fee78d3cd12ba4d04c0a97ae246848cd134149a6117fbe3b3e
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/preprocessor_config.json.metadata 101 aacd2447dd192ffbb2bf02715da28037bfcafe0050155125d3a7a4db319b5062
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/README.md.metadata 101 8a0876343323fceb0121ad8595fd8a353f2b9f6b22e401fe3e369c8a37779a6f
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/tokenizer.json.metadata 101 b3af69587533f10d94518d682b5e8493af4fa292292befc3429c1665f498266b
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/vocabulary.json.metadata 101 259b2ae649bacba41cbe607943a60624bfe6355a4342fe7306ae036985c1fadd
/path/models/TTS/XTTS-v2/.cache/huggingface/download/config.json.metadata 101 fedc5c6b83602faec95120bb3bdca71fdedc1fe8085884c25ad759e4f1e4d1e5
/path/models/TTS/XTTS-v2/.cache/huggingface/download/.gitattributes.metadata 101 1dec8c3737659c661ee8953536e27fb2b6c963b36c164d3ffc8399213ff925dc
/path/models/TTS/XTTS-v2/.cache/huggingface/download/hash.md5.metadata 101 47faeeb4b22e3a19a54be0e8790d9db78c0fb490cb41e599b687206ee9e260a3
/path/models/TTS/XTTS-v2/.cache/huggingface/download/README.md.metadata 101 61a0d47c5a4ad33ff3c3ec04ac7f0d669f883f12c7dc871b1bee8777ce0ea7a0
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/de_sample.wav.metadata 101 ca6999632f5bf027ee78766cb5ac896ab8aa129c8e035497a6aa413bb222fd9e
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/en_sample.wav.metadata 101 9337af2ddddb96e2c05cce43a3c20d49d842ac6946b068ca5cc840d93f284014
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/ja-sample.wav.metadata 101 94f891ce652d8a84283319ae0b245dac83f1c1894c72e6b496f6feb6b2b916a7
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/pt_sample.wav.metadata 101 2ba2cf13918fa161d1dc834d2709418983c94fab4386b24eec46a51735552f8f
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/zh-cn-sample.wav.metadata 101 b986737c9f874b96d883d0661b6e664276d1040d9baadad04d7e9d84d0151cec
/path/models/TTS/XTTS-v2/.cache/huggingface/download/vocab.json.metadata 101 61546148e9d21c465e7d6c1581ac4a40c77c7217d04660f3913265e4034b6a75
/path/models/TTS/XTTS-v2/.cache/huggingface/download/LICENSE.txt.metadata 100 070c0c36d3027b20a3cd35abdd15418192270c12b7380ea76ea0f1fc3404fe77
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/es_sample.wav.metadata 100 66ca3a9f742b9e8a532ef8d4f52abdcde7869de92f7ccec8d9a4d7ab533712b7
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/fr_sample.wav.metadata 100 c8f1924aba143bed0e2f33a1b7a7429591f82f2912d6ef7fab30a0b4b51225dc
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/tr_sample.wav.metadata 100 4dcc4698930712dc769eae2b9067792bb95ebcad690e0eb0c4c62afe3f7fec65
/path/models/TTS/XTTS-v2/configuration.json 47 c502b6328c67638b401df8dd05de89e9e8d1cff9cd0ada10dfbdbe13556c20de
/path/models/TTS/XTTS-v2/.mv 36 edcb2024bcbd1dd3586dd5c1d442d6bdf20cdcd4750ac83243d136b97ed1ee3e
/path/models/TTS/XTTS-v2/hash.1.md5 32 ef2e25fc4639bb81c6c5048740c1ee8268606a160548abaa2e2fe03b12278c54
/path/models/TTS/XTTS-v2/hash.md5 32 ef2e25fc4639bb81c6c5048740c1ee8268606a160548abaa2e2fe03b12278c54
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/.gitignore 1 684888c0ebb17f374298b65ee2807526c066094c701bcc7ebbe1c1095f494fc1
/path/models/TTS/XTTS-v2/.cache/huggingface/.gitignore 1 684888c0ebb17f374298b65ee2807526c066094c701bcc7ebbe1c1095f494fc1
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/config.json.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/.gitattributes.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/model.bin.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/preprocessor_config.json.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/README.md.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/tokenizer.json.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/ASR/whisper/faster-whisper-large-v3/.cache/huggingface/download/vocabulary.json.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/config.json.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/dvae.pth.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/.gitattributes.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/hash.md5.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/LICENSE.txt.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/mel_stats.pth.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/model.pth.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/README.md.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/de_sample.wav.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/en_sample.wav.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/es_sample.wav.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/fr_sample.wav.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/ja-sample.wav.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/pt_sample.wav.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/tr_sample.wav.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/samples/zh-cn-sample.wav.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/speakers_xtts.pth.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
/path/models/TTS/XTTS-v2/.cache/huggingface/download/vocab.json.lock 0 e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855