Question on generalization

zhangzjn / APB2Face

Official pytorch implementation for "APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals", ICASSP'20

MIT License

63 stars 19 forks source link

Hi Zhang! First of all, thank you very much for such an exciting paper. I have a question regarding architecture and experiments. How will the model perform if I train GeometryPredictor and FaceReenactor on different datasets?

If the answer is 'poorly' what is needed to make it look better? something like "Unified Landmark Converter" from your other paper on FReeNet?

(The specific use case I m trying to implement is when a person speaks in one language(not English), and I am passing an English speech and pose/blink of the original video to the model to make a person speak in English.)

zhangzjn / APB2Face

Question on generalization #4