Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)
55
stars
4
forks
source link
Quickly transfer to gptsovits, the effect of this repo doesn't seem to be worth messing. #2
This is very similar to the idea and architecture of gpt-sovits, but the degree of dissemination is far from the gptsovits.