tencent-ailab / V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
2.03k stars 250 forks source link

想问下关于训练的策略 #5

Closed junwenxiong closed 1 month ago

junwenxiong commented 1 month ago

作者你好,感觉这个工作也是用了多阶段的训练策略,可否说一下在不同的阶段训练时,哪些attention需要被优化?谢谢!

tiankuan93 commented 1 month ago

作者你好,感觉这个工作也是用了多阶段的训练策略,可否说一下在不同的阶段训练时,哪些attention需要被优化?谢谢!

We are working on a technical report in which we will elaborate on the training details. Please be patient for a few more days.