About DF-VAE - Githubissues

EndlessSora / DeeperForensics-1.0

[CVPR 2020] A Large-Scale Dataset for Real-World Face Forgery Detection

536 stars 70 forks source link

About DF-VAE #8

Open NNNNAI opened 3 years ago

NNNNAI commented 3 years ago

Thank you for your work!

Do you intend to release the relevant code and training scripts about the DF-VAE?

NNNNAI commented 3 years ago

Btw, as you mentioned in the paper,you use the stacked hourglass networks to extract landmarks. But I found the oringal implementation in the "Stacked hourglass networks for human pose estimation" did not include the landmarks detection. Could you tell me which version of stack hourglass you used in the paper? Thank you very much.

NNNNAI commented 3 years ago

May I ask whether the stacked hourglass of the structure extraction module in dfvae will be trained together, that is, whether the parameters of the stacked hourglass will undergo gradient back propagation.

EndlessSora commented 3 years ago

May I ask whether the stacked hourglass of the structure extraction module in dfvae will be trained together, that is, whether the parameters of the stacked hourglass will undergo gradient back propagation.

The answer is no. The gradient is locked for the stacked hourglass module.

NNNNAI commented 3 years ago

Thank you for your quick answer! BTW, in appendix, you mentioned using kl loss as one of the training losses of dfvae. Does the kl loss here mean that the structure encoder and appearance encoder both do kl loss? Or do you concatenate the latent representations from the structure encoder and appearance encoder, and then predict the mean and variance from the concatenated latent representations to calculate the kl loss ?

NNNNAI commented 3 years ago

Could you further explain how the df-vae combined the structure imformation and appearance and how did the mean and variance in this part of klloss come from? I reproduced your Disentangled module and used 2DFAN in the structure extraction module to extract landmarks information, but the effect has not been good, and even the appearance of a complete face cannot be generated.