RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
MIT License
33.45k stars 3.84k forks source link

生成字幕 #1658

Open Jin-W-FS opened 4 days ago

Jin-W-FS commented 4 days ago

生成与音频同步的字幕:

另:ref_audio_path参数可接受形如base64:xxxxxx的字符串作为base64编码的音频,免去上传音频文件这一步。