This would be amazing if it works!!!
But I am sorry, I think it is not really doing TTS. It requires the wave as input to the model which is not possible in prediction.
After running the training, I have tested the model prediction by filling y with zeros, it produces nothing. I looked at your reply issue #4, still not convinced. Could you please provide inference code to prove it?
This would be amazing if it works!!!
But I am sorry, I think it is not really doing TTS. It requires the wave as input to the model which is not possible in prediction.
After running the training, I have tested the model prediction by filling y with zeros, it produces nothing. I looked at your reply issue #4, still not convinced. Could you please provide inference code to prove it?