hongfz16 / AvatarCLIP

[SIGGRAPH 2022 Journal Track] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
https://hongfz16.github.io/projects/AvatarCLIP.html
Other
1.06k stars 93 forks source link

The correspondence between the codebook and the codebook_embedding #7

Open junfanlin opened 2 years ago

junfanlin commented 2 years ago

Hi,

I'm glad to read your publication and try your released demo. As for motion generation, the essential item should be the correspondence between the codebook and the codebook_embedding. However, when I checked your code, I found that the CLIL features of the decoded poses of the codebook are not equivalent to those of the codebook_embedding. From Fig, 8 of the paper, I found that the CLIP feature of one pose is the sum of multiple CLIP features of different views of that pose. Would you mind describing more details of how to calculate the codebook and codebook_embeding? If you can release the code for extracting codebook_embedding, I will be more than grateful.

Thank you in advance.

Best wishes, Jack