PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
https://paddlespeech.readthedocs.io
Apache License 2.0
10.61k stars 1.81k forks source link

声音区分怎样实现那,根据声纹 #3472

Open chenkang404 opened 11 months ago

chenkang404 commented 11 months ago

首先这个问题我觉得很难用一两句说明白,所以在这里具体说明一下,希望给些建议或关键词。 问题背景:我有一段两个人的对话语音,我使用音频切分模块去进行切分,然后对每段音频进行识别并获得其声纹,根据每段话的声纹来分辨出两个人各自说的话(但这个方法是我自己创造的,即对比每一段声音音频再根据思相似度聚类)。 问题提出:根据声纹区分出两个人的对话有比较有理论支持的方法吗

zxcd commented 11 months ago

使用声纹做说话人区分(Speaker Identification) 这块成熟的方案有很多,可以看看论文什么的

stale[bot] commented 9 months ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.