wavlm Search Results - Githubissues

287 results
for wavlm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OlaWod/FreeVC #83

How to start inference example?

When I do: # inference with FreeVC `CUDA_VISIBLE_DEVICES=0 python convert.py --hpfile logs/freevc.json --ptfile checkpoints/freevc.pth --txtpath convert.txt --outdir outputs/freevc` How do I ge…

asusdisciple updated 6 months ago
1
Wataru-Nakata/miipher #8

About training

Hi Wataru, I'm now at the training stage and have a few questions to ask: - I noticed that you set .repeat(2) in datamodule.py, similarly to the config.yaml, this could be controlled according t…

jjjanicehuang updated 1 month ago
12
s3prl/s3prl #360

SUPERB BNECHMARK.

Hi, I have a quick question. To simulate the same training and testing settings for the SUPERB benchmark should we just run the commands as provided in the README? Or changing the batch size or le…

raotnameh updated 5 months ago
19
Speech-Lab-IITM/CCC-wav2vec-2.0 #1

About Speech Enhancement scores on SUPERB leaderboard

Hi! Thanks for the great repo! I find that CCC-wav2vec 2.0 performs especially well on SUPERB SE task, surpassing WavLM Large with a large margin. I am trying to reproduce it but I not yet succes…

leo19941227 updated 1 year ago
1
espnet/espnet #4910

s3prl inference issue (wav2vec2)

**Describe the bug** ASR model has been trained and packed along with other required materials (stage 14) to be used for inference in another cluster/system. During inference "RuntimeError: Error(s) …

SatwikDutta updated 1 year ago
5
microsoft/unilm #802

how to use wavlm model to extract speaker embedding for spea…

Hi, I wanna use **wavlm** model to extract speaker embedding for speaker verification task. In [the paper](https://arxiv.org/pdf/2110.13900.pdf) it is mentioned that for the task of speaker verificat…

fatemeshiravand updated 1 year ago
2
YoungSeng/DiffuseStyleGesture #30

音频维度问题

关于DiffuseStyleGesture+提取出的特征的维度，提取音频特征的维度为什么要这样设置：40+64+2+2+1024+1 为什么MFCC是40，log-mel是64，韵律特征是4等等，这样设置有什么特别的用意吗，为什么要这样取特征的维度

YoungSeng updated 5 months ago
1
OlaWod/FreeVC #26

请问该方案和hubert+vits的方案，对比下来，有什么优劣？不知道有人尝试过吗？ (how is FreeVC co…

如题

Liujingxiu23 updated 1 year ago
1
m-bain/whisperX #384

emotion detection

Hi Is there any way to detect emotion, stress of the speaker within the whisper domain ? Best

MyraBaba updated 10 months ago
3
csun22/Synthetic-Voice-Detection-Vocoder-Artifacts #8

Benckmark and experiment for WavLM in AntiSpoofing

I found your paper here: https://arxiv.org/pdf/2304.13085 Thanks for your contribution! Can you provide more detail description or sample source code for WavLM in your paper? I have tried to…

v-nhandt21 updated 4 weeks ago
1

上一页 1...1 2 3 4 5 6 7...29 下一页

287 results for wavlm

287 results
for wavlm