WelkinYang / Learn2Sing2.0

Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
https://welkinyang.github.io/Learn2Sing2.0/
176 stars 26 forks source link

About club mutual information #7

Open LindgeW opened 6 months ago

LindgeW commented 6 months ago

Hi, thanks for your good work. I have two questions:

  1. Is the tanh activation function of logvar required? Can you remove it or just replace it with any other activations?

  2. During training, I encountered a problem: the logvar prediction network, whose last layer is 'tanh', always output the '-1', no matter what the input is. And the overall CLUB MI prediction network seems to lose effect (fail to be updated), giving negative MI estimation.