Open thanhkm opened 1 year ago
Hello, thank you for your great project!
I wonder is there any underlying reason to use z instead of z_p + spk_emb for decoder? The second schema could be post_encoder -> z - spk_emb -> z_p -> z_p + spk_emb -> z' -> wav. Will it make the flow more robust in the inference step?
Best regards
Hello, thank you for your great project!
I wonder is there any underlying reason to use z instead of z_p + spk_emb for decoder? The second schema could be post_encoder -> z - spk_emb -> z_p -> z_p + spk_emb -> z' -> wav. Will it make the flow more robust in the inference step?
Best regards