jianchang512 / pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
https://pyvideotrans.com
GNU General Public License v3.0
10.74k stars 1.21k forks source link

自定义视频翻译用chatts本地配音 10分钟的视频大约需要多久呢?进度条到52%以后就不动了,chatts的app.exe好像一直在运行 #619

Open tomyu168 opened 5 hours ago

tomyu168 commented 5 hours ago

看样子得运行大概几十个小时吧

tomyu168 commented 5 hours ago

使用nemo处理英文ok prompt=''%|█▋ | 47/2048(max) [00:23, 12.53it/s] te=["It's a big deal [uv_break] "] | 106/2048(max) [01:39, 12.13it/s] WARNING:ChatTTS.core:found invalid characters: {"'"} text: 3%|██▏ | 11/384(max) [00:01, 7.16it/s] code: 10%|███████▌ | 211/2048(max) [02:06, 1.66it/s] code: 7%|█████▎ | 146/2048(max) [00:49, 10.38it/s] 推理时长: 165.07 秒 | 152/2048(max) [00:50, 10.33it/s] code: 7%|█████▍ | 152/2048(max) [00:50, 12.37it/s]voice='6653'██▋ | 74/2048(max) [00:09, 9.97it/s] 当前使用音色:根据seed=6653获取随机音色 | 160/2048(max) [00:51, 12.07it/s] WARNING:waitress.queue:Task queue depth is 1 NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. code: 7%|█████▌ | 153/2048(max) [01:03, 12.37it/s] 使用nemo处理英文ok prompt=''%|█████▊ | 161/2048(max) [01:04, 12.07it/s] te=['This misunderstanding was a strict attempt'] code: 4%|███▎ | 89/2048(max) [00:47, 1.89it/s] text: 5%|███▊ | 19/384(max) [00:02, 9.36it/s] code: 9%|██████▌ | 181/2048(max) [01:28, 3.93it/s] 推理时长: 84.6 秒 | 0/2048(max) [00:00, ?it/s] code: 9%|██████▌ | 181/2048(max) [01:28, 8.70it/s]voice='6653'▏ | 6/2048(max) [00:00, 13.55it/s] 当前使用音色:根据seed=6653获取随机音色 | 189/2048(max) [01:29, 8.63it/s] WARNING:waitress.queue:Task queue depth is 1 NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. code: 9%|██████▌ | 181/2048(max) [01:46, 8.70it/s] 使用nemo处理英文ok | 7/2048(max) [00:18, 13.55it/s] prompt=''%|██████▊ | 189/2048(max) [01:47, 8.63it/s] te=['Put the Fuad Shooker in a special location'] code: 9%|██████▊ | 187/2048(max) [02:04, 1.51it/s] code: 1%|▍ | 12/2048(max) [00:35, 3.00s/it] 推理时长: 161.55 秒 | 14/2048(max) [00:36, 2.05s/it] text: 1%|▊ | 4/384(max) [00:00, 7.38it/s]voice='6653'▋ | 20/2048(max) [00:36, 1.41it/s] 当前使用音色:根据seed=6653获取随机音色 | 203/2048(max) [02:05, 1.91it/s] WARNING:waitress.queue:Task queue depth is 1 | 9/384(max) [00:01, 9.45it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. 使用nemo处理英文ok | 21/2048(max) [00:49, 1.41it/s] prompt=''%|███████▎ | 203/2048(max) [02:18, 1.91it/s] te=["The captain's plan soon succeeded"] | 10/384(max) [00:14, 3.17s/it] WARNING:ChatTTS.core:found invalid characters: {"'"} text: 4%|███▏ | 16/384(max) [00:36, 2.28s/it] text: 4%|██▉ | 15/384(max) [00:01, 9.97it/s] code: 13%|█████████▊ | 273/2048(max) [02:46, 1.64it/s] 推理时长: 240.04 秒 | 91/2048(max) [01:17, 10.87it/s] code: 3%|██▎ | 62/2048(max) [00:05, 12.76it/s]voice='6653'███▌ | 97/2048(max) [01:17, 12.39it/s] 当前使用音色:根据seed=6653获取随机音色 WARNING:waitress.queue:Task queue depth is 1 | 49/2048(max) [00:06, 9.74it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. code: 3%|██▎ | 63/2048(max) [00:22, 3.03s/it] 使用nemo处理英文ok | 99/2048(max) [01:35, 2.71s/it] prompt='' te=['When the captain entered the seventh floor of the captain'] code: 6%|████▎ | 118/2048(max) [01:54, 1.03it/s] 推理时长: 151.94 秒 | 117/2048(max) [01:54, 3.67it/s] text: 5%|████▏ | 21/384(max) [00:02, 9.52it/s] code: 4%|███▎ | 89/2048(max) [00:42, 8.25it/s]voice='6653'▏ | 4/2048(max) [00:00, 14.37it/s] 当前使用音色:根据seed=6653获取随机音色 | 66/2048(max) [00:42, 5.89it/s] WARNING:waitress.queue:Task queue depth is 1 | 70/2048(max) [00:42, 8.27it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. code: 4%|███▎ | 91/2048(max) [00:54, 1.92s/it] 使用nemo处理英文ok | 5/2048(max) [00:18, 14.37it/s] prompt='' te=['The fifth layer of the captain was forced to make a movement'] code: 5%|███▌ | 100/2048(max) [01:18, 1.27it/s] code: 1%|▌ | 15/2048(max) [00:36, 1.08s/it] 推理时长: 114.78 秒 | 9/384(max) [00:00, 10.10it/s] code: 4%|██▊ | 77/2048(max) [01:19, 1.33s/it] code: 1%|▊ | 23/2048(max) [00:37, 3.38it/s]v当前使用音色:根据seed=6653获取随机音色 | 15/384(max) [00:01, 12.49it/s] WARNING:waitress.queue:Task queue depth is 1 | 83/2048(max) [01:19, 3.39it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. 使用nemo处理英文ok | 23/2048(max) [00:50, 3.38it/s] prompt=''%|███▎ | 17/384(max) [00:06, 1.21it/s] te=["Captain's crew, Probably F-sixteen"] | 84/2048(max) [01:35, 3.58s/it] WARNING:ChatTTS.core:found invalid characters: {'-', "'"} text: 6%|████▌ | 23/384(max) [00:36, 1.59s/it] text: 5%|███▉ | 20/384(max) [00:02, 8.92it/s] code: 7%|████▊ | 134/2048(max) [02:00, 1.11it/s] code: 4%|███▎ | 90/2048(max) [01:18, 10.73it/s] 推理时长: 192.34 秒 | 92/2048(max) [01:18, 10.74it/s] code: 3%|██ | 56/2048(max) [00:04, 13.66it/s]voice='6653'███▌ | 100/2048(max) [01:18, 13.55it/s] 当前使用音色:根据seed=6653获取随机音色 | 68/2048(max) [00:06, 12.60it/s] WARNING:waitress.queue:Task queue depth is 1 NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. code: 3%|██ | 57/2048(max) [00:16, 13.66it/s] 使用nemo处理英文ok | 100/2048(max) [01:30, 13.55it/s] prompt=''%|██▌ | 70/2048(max) [00:10, 1.32it/s] te=['Forced to use shooting guns [uv_break] '] text: 4%|██▊ | 14/384(max) [00:02, 6.63it/s] code: 4%|███▏ | 88/2048(max) [00:42, 2.06it/s] code: 6%|████▋ | 130/2048(max) [01:56, 8.44it/s] 推理时长: 79.26 秒 | 132/2048(max) [01:56, 9.17it/s] code: 0%|▎ | 7/2048(max) [00:00, 8.04it/s]voice='6653'█████ | 140/2048(max) [01:57, 12.74it/s] 当前使用音色:根据seed=6653获取随机音色 | 108/2048(max) [00:44, 12.27it/s] WARNING:waitress.queue:Task queue depth is 1 | 14/2048(max) [00:01, 11.30it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. 使用nemo处理英文ok | 140/2048(max) [02:12, 12.74it/s] prompt=''%|███▉ | 109/2048(max) [00:59, 12.27it/s] te=['A gun, a gun [uv_break] '] | 14/2048(max) [00:16, 11.30it/s] text: 3%|██▏ | 11/384(max) [00:01, 8.15it/s] code: 9%|██████▋ | 184/2048(max) [02:36, 1.17it/s] code: 9%|██████▋ | 184/2048(max) [02:36, 10.75it/s] 推理时长: 193.06 秒 | 152/2048(max) [01:23, 10.61it/s] code: 2%|█▍ | 40/2048(max) [00:03, 13.14it/s]voice='6653' 当前使用音色:根据seed=6653获取随机音色 | 160/2048(max) [01:24, 12.66it/s] WARNING:waitress.queue:Task queue depth is 1 | 52/2048(max) [00:41, 9.66it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. code: 2%|█▌ | 41/2048(max) [00:32, 5.17s/it] 使用nemo处理英文ok prompt=''%|█████▊ | 161/2048(max) [01:42, 12.66it/s] te=['Let the audience choose [uv_break] '] text: 3%|█▉ | 10/384(max) [00:01, 8.21it/s] code: 8%|██████▎ | 174/2048(max) [02:00, 1.44it/s] code: 0%| | 0/2048(max) [00:00, ?it/s] 推理时长: 192.94 秒 | 2/2048(max) [00:00, 11.08it/s] code: 3%|██ | 55/2048(max) [00:39, 1.38it/s] code: 3%|██▎ | 64/2048(max) [01:17, 2.40it/s] 推理时长: 76.12 秒 | 4/2048(max) [00:00, 12.46it/s] code: 3%|██▍ | 66/2048(max) [01:17, 1.18s/it] 推理时长: 114.59 秒 | 6/2048(max) [00:00, 14.66it/s] voice='6653'▍ | 12/2048(max) [00:00, 21.82it/s] 当前使用音色:根据seed=6653获取随机音色 voice='6653' 当前使用音色:根据seed=6653获取随机音色 NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. voice='6653' INFO:NeMo-text-processing:Creating ClassifyFst grammars. 当前使用音色:根据seed=6653获取随机音色 WARNING:waitress.queue:Task queue depth is 1 NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. 使用nemo处理英文ok | 14/2048(max) [00:16, 21.82it/s] prompt='' te=['One is bombs in all three rooms'] 使用nemo处理英文ok prompt='' te=['It is possible to be exploded by a continuously controlled bomb'] text: 0%| | 0/384(max) [00:00, ?it/s] 使用nemo处理英文ok prompt=''%| | 0/384(max) [00:00, ?it/s] te=['The second is the promised Mosart'] text: 3%|██▌ | 13/384(max) [00:02, 5.25it/s] text: 4%|██▊ | 14/384(max) [00:02, 6.43it/s] text: 7%|████▉ | 25/384(max) [00:02, 9.37it/s] code: 3%|██▏ | 60/2048(max) [01:50, 1.84s/it] code: 3%|██▏ | 59/2048(max) [01:50, 10.27it/s] 推理时长: 146.2 秒 | 20/2048(max) [00:01, 11.24it/s] code: 2%|█▍ | 38/2048(max) [00:03, 12.87it/s]voice='6653'▋ | 19/2048(max) [00:02, 8.11it/s] 当前使用音色:根据seed=6653获取随机音色 | 28/2048(max) [00:02, 13.51it/s] WARNING:waitress.queue:Task queue depth is 1 | 23/2048(max) [00:02, 9.68it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. code: 2%|█▍ | 39/2048(max) [00:17, 12.87it/s] 使用nemo处理英文ok prompt=''%|█ | 29/2048(max) [00:31, 5.17s/it] te=["Wait within two o'clock [uv_break] "] WARNING:ChatTTS.core:found invalid characters: {"'"} text: 4%|██▉ | 15/384(max) [00:01, 9.20it/s] code: 5%|███▊ | 107/2048(max) [00:43, 2.45it/s] code: 3%|█▉ | 52/2048(max) [00:04, 11.13it/s] 推理时长: 150.33 秒 | 98/2048(max) [00:42, 11.15it/s] code: 4%|██▋ | 74/2048(max) [00:43, 7.91it/s]voice='6653'██ | 58/2048(max) [00:05, 12.97it/s] 当前使用音色:根据seed=6653获取随机音色 | 106/2048(max) [00:43, 13.37it/s] WARNING:waitress.queue:Task queue depth is 1 | 80/2048(max) [00:43, 9.73it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. 使用nemo处理英文ok | 60/2048(max) [00:23, 2.84s/it] prompt=''%|███▊ | 106/2048(max) [00:58, 13.37it/s] te=["Fire a shot at this part of the client's room"] WARNING:ChatTTS.core:found invalid characters: {"'"} text: 4%|██▉ | 15/384(max) [00:01, 9.23it/s] code: 5%|███▌ | 96/2048(max) [00:43, 2.19it/s] code: 1%|▋ | 20/2048(max) [00:01, 10.96it/s] 推理时长: 79.4 秒 | 143/2048(max) [01:21, 10.08it/s] code: 1%|▉ | 26/2048(max) [00:02, 12.94it/s]voice='6653' 当前使用音色:根据seed=6653获取随机音色 | 151/2048(max) [01:22, 13.00it/s] WARNING:waitress.queue:Task queue depth is 1 | 112/2048(max) [01:22, 9.64it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. code: 1%|▉ | 27/2048(max) [00:20, 12.94it/s] 使用nemo处理英文ok prompt=''%|█████▍ | 151/2048(max) [01:40, 13.00it/s] te=['The third is to shoot at the target with one shot'] | 113/2048(max) [01:34, 2.69s/it] code: 6%|████▎ | 121/2048(max) [01:57, 1.03it/s] 推理时长: 221.68 秒 | 11/384(max) [00:01, 10.44it/s] text: 5%|███▌ | 18/384(max) [00:01, 10.60it/s] code: 2%|█▋ | 47/2048(max) [00:38, 4.49it/s]voice='6653' | 0/2048(max) [00:00, ?it/s] 当前使用音色:根据seed=6653获取随机音色 | 171/2048(max) [01:58, 4.54it/s] WARNING:waitress.queue:Task queue depth is 1 NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. code: 2%|█▋ | 47/2048(max) [00:51, 4.49it/s] 使用nemo处理英文ok prompt=''%|██████▏ | 171/2048(max) [02:11, 4.54it/s] te=["These are all in front of the video, so don't miss out"] WARNING:ChatTTS.core:found invalid characters: {"'"} code: 9%|██████▉ | 192/2048(max) [02:35, 1.23it/s] code: 3%|██▌ | 69/2048(max) [01:16, 4.69it/s] 推理时长: 260.42 秒 | 191/2048(max) [02:35, 4.07it/s] code: 4%|██▋ | 75/2048(max) [01:16, 8.43it/s]voice='6653'█ | 29/2048(max) [00:38, 9.42it/s] 当前使用音色:根据seed=6653获取随机音色 WARNING:waitress.queue:Task queue depth is 1 | 19/384(max) [00:02, 8.87it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. text: 5%|███▉ | 20/384(max) [00:06, 3.02it/s] code: 4%|██▊ | 76/2048(max) [01:34, 8.43it/s] 使用nemo处理英文ok | 29/2048(max) [00:56, 9.42it/s] prompt=''%| | 0/2048(max) [00:00, ?it/s] te=["But first, let's examine the environment"] WARNING:ChatTTS.core:found invalid characters: {"'"} text: 4%|███▏ | 16/384(max) [00:01, 10.13it/s] code: 8%|█████▌ | 154/2048(max) [01:58, 1.30it/s] code: 5%|███▉ | 109/2048(max) [01:20, 11.18it/s] 推理时长: 155.11 秒 | 57/2048(max) [00:28, 7.97it/s] code: 3%|██ | 58/2048(max) [00:28, 8.02it/s]voice='6653'████▏ | 117/2048(max) [01:20, 13.36it/s] 当前使用音色:根据seed=6653获取随机音色 | 63/2048(max) [00:28, 10.51it/s] WARNING:waitress.queue:Task queue depth is 1 | 70/2048(max) [00:06, 13.41it/s] NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. 使用nemo处理英文ok | 117/2048(max) [01:38, 13.36it/s] prompt=''%|██▎ | 64/2048(max) [00:46, 10.51it/s] te=['This is a five-block guest room'] | 71/2048(max) [00:24, 13.41it/s] WARNING:ChatTTS.core:found invalid characters: {'-'} text: 3%|██▏ | 11/384(max) [00:01, 10.24it/s] code: 5%|███▊ | 107/2048(max) [00:43, 2.46it/s] 推理时长: 79.53 秒 | 154/2048(max) [01:57, 10.09it/s] code: 1%|█ | 30/2048(max) [00:02, 12.91it/s]voice='6653'█████▊ | 160/2048(max) [01:58, 12.67it/s] 当前使用音色:根据seed=6653获取随机音色 | 96/2048(max) [01:06, 9.71it/s] WARNING:waitress.queue:Task queue depth is 1 NeMo-text-processing :: INFO :: Creating ClassifyFst grammars. INFO:NeMo-text-processing:Creating ClassifyFst grammars. code: 2%|█▏ | 31/2048(max) [00:13, 12.91it/s] code: 8%|█████▊