-
Hello,
Thank you for your work on WavLM.
I try to reproduce the results but I have some difficulties.
First of all, I don't undestand exactly the difference between scores displayed in differen…
-
Hi,
In terms of speaker verification, when comparing two audio files with different voices by using the embedding does the speech have to be the same text spoken (text dependence) ?
-
python verification.py --model_name ecapa_tdnn --wav1 vox1_data/David_Faustino/hn8GyCJIfLM_0000012.wav --wav2 vox1_data/Josh_Gad/HXUqYaOwrxA_0000015.wav
2023-08-03 18:48:40 | INFO | fairseq.tasks.te…
-
Thank you for providing the eval script.
I found a hard-coding path `/mnt/bn/jdy-lq-2/s3prl_s3prl_main` in `thirdparty/UniSpeech/downstreams/speaker_verification/models/ecapa_tdnn.py`, which caused …
-
FunASR is a fundamental speech recognition toolkit that offers a variety of features, including speech recognition (ASR), Voice Activity Detection (VAD), Punctuation Restoration, Language Models, Spea…
-
请问能分享完整的代码吗?
-
According to the WavLM paper:
([WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing](https://arxiv.org/pdf/2110.13900.pdf))
They used ECAPA-TDNN embeddings model for …
-
(sd) root@novo:~/pyannote-audio-develop# python seg.py
CUDA is available.
torchvision is not available - cannot save figures
Traceback (most recent call last):
File "/root/pyannote-audio-develo…
-
Hi, why not add speaker classification in speaker encoder, or use Speaker Verification feature. If I only use a speaker encoder, will there be any problems with timbral coupling?
-
Is the evaluation data in the thesis table in the article obtained with main.py test after training, or is it obtained by evaluating the results using the conevert? And how to distinguish between s2s …