如果是大头照,找脸部位置就不会有问题,如果是全身照,或者人头位置比较小,就会报维度不匹配错误
` File "infer_audio2vid.py", line 258, in
main()
File "infer_audio2vid.py", line 226, in main
video = pipe(
File "/home/miniconda3/envs/echomimic/lib/python3.8/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/home/project/EchoMimic-main/src/pipelines/pipeline_echo_mimic.py", line 507, in call
pred = self.denoising_unet(
File "/home/miniconda3/envs/echomimic/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/miniconda3/envs/echomimic/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl
return forward_call(*args, **kwargs)
File "/home/project/EchoMimic-main/src/models/unet_3d_echo.py", line 494, in forward
sample = sample + face_musk_fea
RuntimeError: The size of tensor a (64) must match the size of tensor b (56) at non-singleton dimension 4`
如果是大头照,找脸部位置就不会有问题,如果是全身照,或者人头位置比较小,就会报维度不匹配错误 ` File "infer_audio2vid.py", line 258, in
main()
File "infer_audio2vid.py", line 226, in main video = pipe( File "/home/miniconda3/envs/echomimic/lib/python3.8/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context return func(*args, **kwargs)
File "/home/project/EchoMimic-main/src/pipelines/pipeline_echo_mimic.py", line 507, in call pred = self.denoising_unet(
File "/home/miniconda3/envs/echomimic/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl return self._call_impl(*args, **kwargs)
File "/home/miniconda3/envs/echomimic/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl return forward_call(*args, **kwargs)
File "/home/project/EchoMimic-main/src/models/unet_3d_echo.py", line 494, in forward sample = sample + face_musk_fea RuntimeError: The size of tensor a (64) must match the size of tensor b (56) at non-singleton dimension 4`