Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
MIT License
7.42k stars 747 forks source link

关于language embedding #154

Open zjwang21 opened 6 months ago

zjwang21 commented 6 months ago

论文中做法似乎是只在accoustic tokens上加language embedding,大佬这里实现为什么是加在text上呢?