Open skywalker00001 opened 3 months ago
hi, i also encountered similar issues like yours. Did you find any solution to mitigate these artifacts or improve lip synchronization?
hi, i also encountered similar issues like yours. Did you find any solution to mitigate these artifacts or improve lip synchronization?
It seems that the only thing that can be done is to clean the dataset. As long as a high-quality dataset is used, the aforementioned problems will not occur.
https://github.com/user-attachments/assets/8e33dcd4-de7a-4ce9-8fbc-ac74c2494fa7
https://github.com/user-attachments/assets/194cf8f6-6766-42ae-aa32-dd1b3389f02d
Hi everyone, I have been working on training a talking face model using the Hallo code, but I've encountered several issues that I need some advice on. We used a dataset comprising 32 hours of VFHQ and 12 hours of HDTF videos, without performing any data cleaning.
Issue Description:
Training Details: Model Architecture: Hallo code for talking face generation Dataset: 32 hours of VFHQ + 12 hours of HDTF videos (uncleaned) Training Parameters: Aligned with the parameters provided in the original code
Request for Advice: Has anyone encountered similar issues with background artifacts and lip sync mismatch in talking face models? Are there any recommended data cleaning steps or techniques to mitigate these artifacts and improve lip synchronization? Any insights or suggestions would be greatly appreciated! Thank you in advance for your help!
大家好,
我最近在使用Hallo代码训练一个谈话脸模型时遇到了一些问题,需要大家的建议。我们使用了包含32小时VFHQ和12小时HDTF视频的数据集,未进行数据清洗工作。
问题描述:
训练详情: 模型架构:用于谈话脸生成的Hallo代码 数据集:32小时VFHQ + 12小时HDTF视频(未清洗) 训练参数:与原始代码提供的参数对齐
请求建议: 有没有人遇到过类似的谈话脸模型中的背景伪影和嘴形不同步问题? 是否有推荐的数据清洗步骤或技术可以减轻这些伪影并提高嘴形同步效果? 任何见解或建议都将不胜感激! 提前感谢大家的帮助!