zhangzjn / APB2Face

Official pytorch implementation for "APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals", ICASSP'20
MIT License
63 stars 19 forks source link

Question on generalization #4

Closed leg0m4n closed 4 years ago

leg0m4n commented 4 years ago

Hi Zhang! First of all, thank you very much for such an exciting paper. I have a question regarding architecture and experiments. How will the model perform if I train GeometryPredictor and FaceReenactor on different datasets?

If the answer is 'poorly' what is needed to make it look better? something like "Unified Landmark Converter" from your other paper on FReeNet?

(The specific use case I m trying to implement is when a person speaks in one language(not English), and I am passing an English speech and pose/blink of the original video to the model to make a person speak in English.)

zhangzjn commented 4 years ago