-
### System Info
I am trying to run this: **bash decode_wavlm_large_linear_vicuna_7b.sh**
But, not sure, what has to be given for ckpt_path, currently I do not have model.pt? Where do I get this…
-
```
> Downloading WavLM model to E:\workspace\clone-voice\tts\wavlm\WavLM-Large.pt ...
Traceback (most recent call last):
File "D:\miniconda3\lib\urllib\request.py", line 1348, in do_open
h…
-
Thanks for your work. In the paper, you mentioned that the teacher model is wavLM + ECAPA-TDNN. However, in your implementation, I only found you loaded the weights of wavLM (line 13 in train/experime…
gancx updated
1 month ago
-
Hi team,
Thank you very much for releasing this model!
I'm curious about training/inference with WavLM to improve performance.
Running inference with WavLM-Base throws this error:
```
% …
-
Hi,thanks for sharing pre-trained models. But I have met some problems as follows:
I followed the sample code on this page: https://github.com/microsoft/unilm/tree/master/wavlm ,but I got abnormal la…
-
### Describe the bug
Hi,
I am trying to extract tokens using the modules: `speechbrain.lobes.models.huggingface_transformers.discrete_wavlm module` and `speechbrain.lobes.models.huggingface_transf…
-
According to the WavLM paper:
([WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing](https://arxiv.org/pdf/2110.13900.pdf))
They used ECAPA-TDNN embeddings model for …
-
Hello, thanks for sharing code !
In the DiffuseStyleGesture , the model only use one audio feature , wavlm .
But when extract wavlm feature from raw wav , the [code](https://github.com/YoungSeng/Di…
-
I preprocessed around ~65GB of speech embeddings for VCTK data. Do you use [kmeans implementation](https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html) from sklearn? What hy…
-
Hello, i want to implement flash attention for wavlm, where relative positions are used, i saw an issue, where somebody said it is not supported yet. So question is the same, is it now supported, or i…