Open thepowerfuldeez opened 3 years ago
hi! nice work and finally non-autoregressive tts with no explicit durations as labels. Do you have any plans to release pytorch implementation?
Thanks. I'm sorry that I don't have time to reimplement it in Pytorch. I actually heavily referenced this wonderful NAR-MT model XuezheMax/flowseq, which is implemented in Pytorch, when I was doing VAENAR-TTS implementation. I think you can find most code snippets (e.g. the flow part) needed to implement a Pytorch version VAENAT-TTS.
I’ll go get it a try, thank you! Most of the modules are already implemented (eg encoder and decoder from transformer-tts or postnet from tacotron2)
Hi @thepowerfuldeez, I just published the PyTorch implementation of VAENAR-TTS, and you can check it here: https://github.com/keonlee9420/VAENAR-TTS I'm training the model and going to release pre-trained models as well. Please feel free to play with it, and I hope it helps! Any reports are appreciated.
And thanks @light1726 for the great work. It was exciting to implement the PyTorch version of VAENAR-TTS, and if you have room for it, I want to ask you about sharing my implementation in README.md. I think it's not a big deal since my work is hugely based on yours and can be seen as another implementation in another computer language. I am looking forward to your answer! Thanks.
hi! nice work and finally non-autoregressive tts with no explicit durations as labels. Do you have any plans to release pytorch implementation?