-
首先这个问题我觉得很难用一两句说明白,所以在这里具体说明一下,希望给些建议或关键词。
问题背景:我有一段两个人的对话语音,我使用音频切分模块去进行切分,然后对每段音频进行识别并获得其声纹,根据每段话的声纹来分辨出两个人各自说的话(但这个方法是我自己创造的,即对比每一段声音音频再根据思相似度聚类)。
问题提出:根据声纹区分出两个人的对话有比较有理论支持的方法吗
-
Dataloader name: `hse_thai/hse_thai.py`
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?hse_thai
| Dataset| hse_thai |
|-------------|---|
| Description | he HSE Thai C…
-
Thank you for this code.How can I use this code to do speaker recognition on my own dataset.
-
Hello,
I start to use transcribee to transcript some of the podlovers episodes (podlovers.org). I use a local env (Ubuntu 20.04) on a dell xps laptop. I have tried to transcribe 3 episodes. The ali…
-
(github怎么按了回车键直接就发出去了……我还没编辑完)
跑了个联动回,语音转文字后还需要逐行去标注说话人
然后翻到了一个声纹识别的包:
https://github.com/pyannote/pyannote-audio
应用项目:
https://github.com/yinruiqing/pyannote-whisper
https://github.com/lablab-a…
-
Hi,
#1 - nuget for visual studio 2019 did NOT work. Kept telling me it couldn't find it when it was clearly there. I got around that by compiling for C#sharp (the command "mcs" was incorrect. I use…
-
/usr/share/sounds/alsa/Front_Center.wav
http://doc.qt.io/qt-5/qml-qtmultimedia-audio.html
-
**[ UUID ]** ea8d85de-e620-4574-8522-9edc9083eaf2
**[ Session Name ]** The Right to Privacy in India: A Look at Digital Identification and Aadhaar
**[ Primary Space ]** Privacy and Security
**[ S…
-
**Is your feature request related to a problem? Please describe.**
It can be useful to modify audio passed to STT plugins to remove silence and normalize audio levels for better accuracy. There are a…
-