-
(Vach) wing@DESKTOP-GF5P71Q:/data/Vach$ python app.py
Block Mode: True
Namespace(real_fps=15, mike=False, tts='edgetts', link_name='ErNerf', model_name='obama', base_dir='/data/Vach', block_mode=Tru…
-
I have several consecutive PNG images with a transparent background, and a WAV file. How do I use moviepy to combine them into a webm video with a transparent background?
-
the extracted gt img, parsing img, and torso img seem not look well.
-
Hello! I want to express my appreciation for your excellent work.
I have a question regarding inference speed.
I recently conducted a test using a 14-second-long audio clip (equivalent to 351 fr…
-
作者,您好!我使用康辉主持人的说话视频来测试你们的算法,发现效果很不好,主要原因在于中文音频特征的提取,我换成deepspeech效果还是不太行,请问有什么办法解决这种中文说话视频的数字人训练呢?非常期待您们的回复,谢谢!
-
excuse me,is there any plan to develop the torso model training code?
-
我看代码中注释提到了hubert_cn, 请问hubert_cn是哪个模型,能否提供下HF链接
-
换自己的训练模型时,声音我是用的Hubert, 启动app.py报错,大佬帮看看,感谢!
trainer = Trainer('ngp', opt, model, device=device, workspace=opt.workspace, criterion=criterion, fp16=opt.fp16, metrics=metrics, use_checkpoint=opt.c…
-
Hey,
mir ist aufgefallen, dass beim Recycling von gebrauchten moderierten Brennstoffzellen immer K2-Tritium entsteht, aber man kann es nirgendwo verwenden. Auch gibt es einen Mobilen Fusionsreaktor i…
-
How to specify the audio for reasoning?