-
在执行训练命令时报错了。
命令:!python3.8 finetune_speaker_v2.py -m "./OUTPUT_MODEL" --max_epochs 1000 --drop_speaker_embed True
报错日志:
`INFO:OUTPUT_MODEL:{'train': {'log_interval': 10, 'eval_interval': 100, 'se…
-
WavLMのところでバッチサイズ1のときのshapeがうまくあっていない様子
```
RuntimeError: Given groups=1, weight of size [512, 1, 10], expected input[1, 5945, 1] to have 1 channels, but got 5945 channels instead
```
修正中です
-
### Describe the bug
when trying to finetune WavLM and using DDP. there are some unused parameters. This causes the run to crash. when using --find_unused_parameters it says there are no unused par…
-
We are currently working on identifying the backend versions with which we are compatible and with which we want to be compatible. These backends are PyTorch and TensorFlow. We will be considering Fla…
-
Please check whether this paper is about 'Voice Conversion' or not.
## article info.
- title: **Audio Deepfake Detection with Self-Supervised WavLM and Multi-Fusion Attentive Classifier**
- summary: …
-
model | EER(mine) | EER(official)
-- | -- | --
wavlm_large_nofinetune.pth | 0.965 | 0.75
wavlm_large_finetune.pth | 0.631 | 0.431
The above result…
-
Using noise scaled MAS for VITS2
Using duration discriminator for VITS2
INFO:models:Loaded checkpoint 'Data\abc\models\DUR_0.pth' (iteration 0)
ERROR:models:emb_g.weight is not in the checkpoint
I…
-
跟 worker 不知道有关系么 , batch 设置为1 就会报这个错误 ,batch 高 ,占用现存太大
Traceback (most recent call last):
File "train_ms.py", line 840, in
run()
File "train_ms.py", line 361, in run
trai…
-
Hi, thanks for your great work! I would like to use VSim for speaker similarity evaluation. From the document, I see that I should use "wavlm_large_fintune.pth" model when "model_type=valle". I'm not …
-
While executing the stage2 training I am getting cuda out of memory error continuously. I am executing stage2 training code in NVIDIA L40S GPU.
File "train_second.py", line 827, in
main(…