speaker - Githubissues

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

https://funaudiollm.github.io/

Apache License 2.0

2.52k stars 230 forks source link

Closed Lixi20 closed 1 week ago

Lixi20 commented 2 weeks ago

可以识别音频里面的speaker吗。有没有借口？

Lixi20 commented 2 weeks ago

可以识别音频里面的speaker吗。有没有接口？

ZhihaoDU commented 1 week ago

我想你要找的是SenseVoice或者3D-speaker。CosyVoice是生成模型，不能识别说话人。