-
```log
03-17 22:03:47 | INFO | subprocess.py:23 | Running: train_ms_jp_extra.py --config Data\Nene\config.json --model Data\Nene
03-17 22:03:51 | INFO | train_ms_jp_extra.py:110 | Loading config…
-
I think there may be two problems.
First when I extract wavlm audio features by runnning ```setup_cmumosei.sh```, I'm able to get 25 layers result which may include model input embedding. But other …
-
@ilucasgoncalves I was looking at the code wanted to know what GPU memory size was used during training for categorical SER. It looks like a single GPU was used, and I am assuming a GPU with over 20GB…
-
### Tested versions
3.1.1
### System information
ubuntu 20.04, 2xGPU A100
### Issue description
Hello Hervé,
I am having issues with multi-GPU training that I am not sure how to solve. I woul…
-
Hi, thanks a lot for sharing your work. But I have met a problem about extracting wavlm feature.
I tried to extract query_feats, matching_set, but features of the last 2 layers (23, 24) are always Na…
-
Hey, this was a fantastic repo I found in my research from the last few weeks I am trying to understand some code things from your repo is it possible for you to solve my issue below written
1. The…
-
```
C:\WorkSpace\Style-Bert-VITS2\sbv2\Style-Bert-VITS2\venv\lib\site-packages\pyannote\audio\core\io.py:43: UserWarning: torchaudio._backend.set_audio_backend has been deprecated. With dispatcher …
-
The extracted wavLM features have different dimensions for each input file. The shape of numpy feature vector of a input file is (67,1024). Does this mean there are 67 feature vectors with each of 102…
-
执行app.py的代码如下:
![baocuo1](https://github.com/jianchang512/clone-voice/assets/37930393/9765cd5e-1e28-4141-981e-9a90c1b9c451)
其中有一行为
```shell
> Downloading WavLM model to /Users/xcj/Desktop/clone-vo…
-
Hello, I just discovered onnx format and its advantages in speed.
Has anyone tried to export MeloTTS to onnx format?