怎么训练出带有情绪的声音

Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Apache License 2.0

4.69k stars 705 forks source link

Open lijing0224 opened 11 months ago

lijing0224 commented 11 months ago

比如给出一段文本，用生气的语气，开心的语气读出来，应该怎么准备数据，是要每种情绪训练一个语音合成模型，然后对文本先做分类，如果是开心，就选开心对应的语音合成模型，是这样吗

AnyaCoder commented 11 months ago

这个得需要加情绪识别模型了。隔壁的Bert-VITS-2https://github.com/fishaudio/Bert-VITS2.git大概能做到这一点。

anfogy commented 11 months ago

EmotionalVITS应该也可，但没试过

mikeyang01 commented 11 months ago

比如给出一段文本，用生气的语气，开心的语气读出来，应该怎么准备数据，是要每种情绪训练一个语音合成模型，然后对文本先做分类，如果是开心，就选开心对应的语音合成模型，是这样吗

您说的方法应该可以吧, 成了告我们一声

mikeyang01 commented 11 months ago

这个得需要加情绪识别模型了。隔壁的Bert-VITS-2https://github.com/fishaudio/Bert-VITS2.git大概能做到这一点。

这个是有感情, 但是无法选择感情吧