instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
https://instantid.github.io/
Apache License 2.0
11.12k stars 807 forks source link

question about how to control expression? #142

Closed cavalier501 closed 9 months ago

cavalier501 commented 9 months ago

Thank you for your work, but I have some inquiries regarding facial expression control about instantID.

In your work, it seems that the face embedding only provides identity information of the face in the reference image, lacking facial expression details. On the other hand, landmarks only consist of five key points, offering limited spatial control.

I wonder: How does InstantID manage to maintain consistency between the facial expressions in the output image and the reference image?

Thanks.

haofanwang commented 9 months ago

Actually, we don't want to constrain and fix the facial expression as the same as the given image. That is why we only use 5 kpts. As for your concern, you can adopt another facial pose controlnet to control the detail of face.

cavalier501 commented 9 months ago

Actually, we don't want to constrain and fix the facial expression as the same as the given image. That is why we only use 5 kpts. As for your concern, you can adopt another facial pose controlnet to control the detail of face.

Thanks much for your answer.