daswer123 / hallo-webui

Webui for Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
https://fudan-generative-vision.github.io/hallo/
59 stars 20 forks source link

RuntimeError: The size of tensor a (96) must match the size of tensor b (64) at non-singleton dimension 4 #19

Open Song367 opened 2 weeks ago

Song367 commented 2 weeks ago

Traceback (most recent call last): File "F:\workplace\hallo-webui\scripts\inference.py", line 424, in inference_process( File "F:\workplace\hallo-webui\scripts\inference.py", line 364, in inference_process pipeline_output = pipeline( File "F:\workplace\hallo-webui\venv\lib\site-packages\torch\utils_contextlib.py", line 115, in decorate_context return func(*args, kwargs) File "F:\workplace\hallo-webui\hallo\animate\face_animate.py", line 401, in call noise_pred = self.denoising_unet( File "F:\workplace\hallo-webui\venv\lib\site-packages\torch\nn\modules\module.py", line 1511, in _wrapped_call_impl return self._call_impl(*args, *kwargs) File "F:\workplace\hallo-webui\venv\lib\site-packages\torch\nn\modules\module.py", line 1520, in _call_impl return forward_call(args, kwargs) File "F:\workplace\hallo-webui\hallo\models\unet_3d.py", line 605, in forward sample = sample + mask_cond_fea RuntimeError: The size of tensor a (96) must match the size of tensor b (64) at non-singleton dimension 4