daswer123 / hallo-webui

Webui for Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
https://fudan-generative-vision.github.io/hallo/
59 stars 20 forks source link

RuntimeError: Image decoding failed #15

Open batuhan3526 opened 3 weeks ago

batuhan3526 commented 3 weeks ago

I did everthing in the video

Video Link : https://www.youtube.com/watch?v=JX9EP7o4KCg

But ı got a error when ı trying to use this tool.

First of all this is my cmd output:

After giving a audio and image as input and pressing the generate button, I get the following error:.

(venv) (hallo) X:\VideoProduction\Hallo\photo2vid\hallo-webui>start.bat Running on local URL: http://127.0.0.1:7860

To create a public link, set share=True in launch(). A matching Triton is not available, some optimizations will not be enabled Traceback (most recent call last): File "X:\VideoProduction\Hallo\photo2vid\hallo-webui\venv\lib\site-packages\xformers__init__.py", line 55, in _is_triton_available from xformers.triton.softmax import softmax as triton_softmax # noqa File "X:\VideoProduction\Hallo\photo2vid\hallo-webui\venv\lib\site-packages\xformers\triton\softmax.py", line 11, in import triton ModuleNotFoundError: No module named 'triton' WARNING:py.warnings:X:\VideoProduction\Hallo\photo2vid\hallo-webui\venv\lib\site-packages\onnxruntime\capi\onnxruntime_inference_collection.py:69: UserWarning: Specified provider 'CUDAExecutionProvider' is not in available provider names.Available providers: 'AzureExecutionProvider, CPUExecutionProvider' warnings.warn(

Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}} find model: ./pretrained_models/face_analysis\models\1k3d68.onnx landmark_3d_68 ['None', 3, 192, 192] 0.0 1.0 Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}} find model: ./pretrained_models/face_analysis\models\2d106det.onnx landmark_2d_106 ['None', 3, 192, 192] 0.0 1.0 Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}} find model: ./pretrained_models/face_analysis\models\genderage.onnx genderage ['None', 3, 96, 96] 0.0 1.0 Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}} find model: ./pretrained_models/face_analysis\models\glintr100.onnx recognition ['None', 3, 112, 112] 127.5 127.5 Applied providers: ['CPUExecutionProvider'], with options: {'CPUExecutionProvider': {}} find model: ./pretrained_models/face_analysis\models\scrfd_10g_bnkps.onnx detection [1, 3, '?', '?'] 127.5 128.0 set det-size: (640, 640) WARNING:py.warnings:X:\VideoProduction\Hallo\photo2vid\hallo-webui\venv\lib\site-packages\insightface\utils\transform.py:68: FutureWarning: rcond parameter will change to the default of machine precision times max(M, N) where M and N are the input matrix dimensions. To use the future default and silence this warning we advise to pass rcond=None, to keep using the old, explicitly pass rcond=-1. P = np.linalg.lstsq(X_homo, Y)[0].T # Affine matrix. 3 x 4

WARNING: All log messages before absl::InitializeLog() is called are written to STDERR W0000 00:00:1719650612.660818 8592 face_landmarker_graph.cc:174] Sets FaceBlendshapesGraph acceleration to xnnpack by default. INFO: Created TensorFlow Lite XNNPACK delegate for CPU. W0000 00:00:1719650612.682909 15524 inference_feedback_manager.cc:114] Feedback manager requires a model with a single signature inference. Disabling support for feedback tensors. W0000 00:00:1719650612.698059 11600 inference_feedback_manager.cc:114] Feedback manager requires a model with a single signature inference. Disabling support for feedback tensors. Traceback (most recent call last): File "X:\VideoProduction\Hallo\photo2vid\hallo-webui\scripts\inference.py", line 424, in inference_process( File "X:\VideoProduction\Hallo\photo2vid\hallo-webui\scripts\inference.py", line 187, in inference_process source_image_lip_mask = image_processor.preprocess( File "x:\videoproduction\hallo\photo2vid\hallo-webui\hallo\datasets\image_processor.py", line 130, in preprocess get_mask(source_image_path, cache_dir, face_region_ratio) File "x:\videoproduction\hallo\photo2vid\hallo-webui\hallo\utils\util.py", line 436, in get_mask landmarks, height, width = get_landmark(file) File "x:\videoproduction\hallo\photo2vid\hallo-webui\hallo\utils\util.py", line 371, in get_landmark image = mp.Image.create_from_file(str(file)) RuntimeError: Image decoding failed (can't fopen): C:\Users\Kullanıcı\AppData\Local\Temp\gradio\f2a39ffc3155fe33297a57fec8e9ad5829894864\coni.jpeg

My specs: Python 3.10.14 Win11 Pro onnxruntime=1.18.1

Also I prefer GPU not CPU. Could you help me ? Thanks for your time.