Open Kanraaaaa opened 2 years ago
Hi Thanks for your questions! 1/ Yes the strenght_embedding is predicted by learned ranking function. 0- most weak; 1-most strong; 2/ Thanks a lot!! I have corrected the uppder bound to lower bound in the published version; 3/ We are using ESD which is a parallel emotional speech database. We use the reference speech with the same linguistic content for evaluation. We calculate the duration of voiced speech.
@KunZhou9646
Hello, sorry for the interruption but I want to know where I can find the implementation of TextMelIDLoader and TextMelIDCollate, which were supposed to be located under codes/reader
, but they don't exist.
And thank you for sharing your great work! I am really enjoying it :)
Can I use the implementation in nonparaSeq2seqVC_code directly? or does it require any modification?
Hi Kun, I have read this paper and tried to train this network, but I meet some questions as follows:
Look forward your kind reply! Thank you:)