请问可以分离出指定人的语音吗

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Apache License 2.0

1.07k stars 93 forks source link

Closed cx-cs closed 8 months ago

cx-cs commented 8 months ago

在多人说话重叠的场景是否可以分离出指定说话人的声音？

yfchenlucky commented 8 months ago

通过 https://github.com/alibaba-damo-academy/3D-Speaker/tree/main/egs/3dspeaker/speaker-diarization 代码可以将一段话中的各说话人语音分离，如果你想分离出置顶说话人声音，可以通过提取指定说话人的speaker embedding和各聚类中心比较，选择最近的类，再获取其相应语音片段。