Closed ljh0412 closed 7 months ago
spectrogram lengths and ying lengths are same in this case, since they use same hop-lengths, etc. Each spec frame has a corresponding ying frame.
spectrogram lengths and ying lengths are same in this case, since they use same hop-lengths, etc. Each spec frame has a corresponding ying frame.
Thank you for your reply.
Thanks for interesting paper and nice repo.
I got question about pitch encoder. In pitch encoder, it takes inputs as ying, spectrogram lengths and speaker embedding. But its quite wired thing as the encoder get length based mask by common.sequence_mask, so it should be ying lengths i think.
Is it should be replaced with the parameter? Please note me to adjust my codes.