I am trying to get the duration of the given text (text input that we pass to txt2vec) to control the d_control parameter. I'm getting the log_duration_prediction from the ZMM-TTS.txt2vec.model.modules.VarianceAdaptor but I couldn't understand the values type (whether it is in samples or any other format). Ultimately, i'm trying to convert this duration to milliseconds/seconds.
Anyone who played with these parameters, please lead me to convert these values to milliseconds/seconds format. If is it not possible, then how can we get the duration of the given text from this model in the above mentioned format??
Hi all,
I am trying to get the duration of the given text (text input that we pass to txt2vec) to control the
d_control
parameter. I'm getting thelog_duration_prediction
from theZMM-TTS.txt2vec.model.modules.VarianceAdaptor
but I couldn't understand the values type (whether it is in samples or any other format). Ultimately, i'm trying to convert this duration to milliseconds/seconds.Anyone who played with these parameters, please lead me to convert these values to milliseconds/seconds format. If is it not possible, then how can we get the duration of the given text from this model in the above mentioned format??
Also, is the
d_control
range is between 0 - 1??Thanks in Advance.