Plachtaa / seed-vc

zero-shot voice conversion with in context learning
GNU General Public License v3.0
137 stars 15 forks source link

会有语音克隆的版本吗 #3

Open TheHonestBob opened 1 week ago

TheHonestBob commented 1 week ago

我非音频方向的,想问一下该架构适合语音克隆吗?或者未来会有语音克隆的版本计划吗

Plachtaa commented 1 week ago

zero shot就是可以语音克隆的

TheHonestBob commented 1 week ago

zero shot就是可以语音克隆的

我看放出的demo,是输入目标音频+prompt音频,是否能够支持text+prompt音频,目前来看如果我想实现语音克隆,貌似需要先tts,然后使用seed-vc项目

Plachtaa commented 1 week ago

voice conversion和text to speech是不同的task,zero shot voice conversion只能clone timbre

TheHonestBob commented 1 week ago

voice conversion和text to speech是不同的task,这个只能clone timbre

voice conversion和text to speech这个我明白,我的意思是voice clone,我理解的voice clone是输入text+prompt音频,然后输出prompt音频音色的text音频