Clarity related to Sync Discriminator

Hi, Kudos for the great work. I am trying to implement your model on custom dataset. Could you please provide clarity on how you are training the synchronous discriminator?, As mentioned in the paper the discriminator is trained with original clips as in-sync class and mismatched clips as out-of-sync.

Can I know if this discriminator is trained also trained while training the generator itself or will it be pre-trained?

If sync discriminator is also trained when the other discriminators and generator is trained, What will be the label assigned to the generated video and audio pair to calculate the loss and propagate it back?

The pertained models provided are trained with having the sync discriminator also mentioned in your latest paper?

Thanks

DinoMan / speech-driven-animation

Clarity related to Sync Discriminator #54