Open wang1612 opened 2 years ago
It might be due to the content loss in the content embedding. Maybe you could try replacing the SEA with a better self-supervised model.
@auspicious3000 But I did not use SEA and the same problem occurred. Do you know the reason?
Why did the speech content of the converted voice with my own trained model changed? Do you know the reason?