Open schxnhxlz opened 2 months ago
Thank you very much for using my project!
In inference, we use the ground truth torso image to generate only the face, so if you render with the camera position and eye vector identical to the training set, and change the audio to your desired data, it will connect well with the exterior of the previously cropped bbox image. Additionally, you could also attach only a specific mask you want through stitching.
Hey there,
first of all great project! I would love to try it out. Does it seemlesly work to put the generated video back on top of the raw video?
Like this:
To remain the real eye and forehead movement i want to mask out just the nose mouth and jaw:
Would be awesome if someone could answer me this before train an own video :) Thanks in advance!