Request for Detailed Implementation of SynthesizerTrn Module in DDDM Encoder

Hello,

I've been reviewing the SynthesizerTrn Module within vc_dddm_mixup.py and it appears to be incomplete. Could you kindly provide the detailed implementation of the forward function? Your assistance would be greatly appreciated.

Additionally, I am encountering some difficulties in understanding the process described for randomly selecting the speaker style s_r using binary selection to achieve the mixed speech representation. This concept, referenced in the document on:

Page 2, Paragraph 3, Line 4
Section 4.3 on Page 6, Paragraph 2, Line 1
Page 7 under the Training section, Line 5

is crucial for my comprehension and application of the model. Could you please provide further clarification on this process or any example implementations?

Thank you!

hayeong0 / DDDM-VC

Request for Detailed Implementation of SynthesizerTrn Module in DDDM Encoder #4