-
Hi, I am trying to finetune FreeVC-s model with a small dataset formatted in vctk format. I have done necessary changes in config file. I am also not using SR-based augmentation. But when I run train.…
-
Trying to compare it with https://huggingface.co/nvidia/speakerverification_en_titanet_large
-
안녕하세요.
굉장한 프로젝트를 오픈소스로 올려주셔서 감사합니다.
VC과 TTS를 융합하기 위해서 w2v를 뽑아내는 컨셉이 정말 재밌습니다.
흥미가 생겨서 학습을 시켜보고 있던 중에
TTV transformer 관련 질문이 생겨 이렇게 issue 올립니다.
posterior => decode 는 잘 되는 것을 확인했습니다만,
prior와 p…
-
First of all, Thanks for providing this wonderful model.
Whenever I try to generate a config, it wouldn't add a spk2id no matter what (and it won't start training without one, naturally). I tried …
-
@ductuantruong
Thank you for sharing your excellent research.
I would like to inquire about training config options.
It is known that applying speech perturbation in speaker verification per…
-
I finetune the [model](https://huggingface.co/yl4579/StyleTTS2-LibriTTS) you posted in the public domain, which was trained on libritts, on my data. With the default config config_ft.conf ,
`
log_di…
-
Hello, I wanted to train WavLM on a custom dataset of mine that has 44.1khz sample rate to use it as an encoder in an audio project, because the WavLM-Large and the rest of the pretrained models are t…
-
I don’t know why this error is reported. Here is my entire error display and the contents of the configuration file.
Traceback (most recent call last):
File "train_finetune.py", line 709, in
…
-
Just wondering because this project seems great but 16000hz is a bit too low frequency for my needs.
-
**Describe your question**
Hi developers,
I was trying to run this experiment: https://github.com/espnet/espnet/tree/master/egs2/librispeech/asr2#expasr_train_discrete_asr_e_branchformer1_raw_wavl…
Slyne updated
7 months ago