hayeong0 / DDDM-VC

Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)
https://hayeong0.github.io/DDDM-VC-demo/
160 stars 18 forks source link

Request for Detailed Implementation of SynthesizerTrn Module in DDDM Encoder #4

Closed kur114 closed 5 months ago

kur114 commented 6 months ago

Hello,

I've been reviewing the SynthesizerTrn Module within vc_dddm_mixup.py and it appears to be incomplete. Could you kindly provide the detailed implementation of the forward function? Your assistance would be greatly appreciated.

Additionally, I am encountering some difficulties in understanding the process described for randomly selecting the speaker style s_r using binary selection to achieve the mixed speech representation. This concept, referenced in the document on:

is crucial for my comprehension and application of the model. Could you please provide further clarification on this process or any example implementations?

Thank you!