-
Hi , I see many vocoder implementation in your project.
I want to train a vocoder on **multi-speaker dataset,** which vocoder can be used for it?
And the mel-files (containing **audio.npy and mel…
-
Hi @wookladin ,
I was trying to fine-tune HIFI-GAN for a single speaker dataset(20 mins of Audio) and the training time per epoch was around 35 seconds. This seems too long. Any ideas of how to mak…
-
Is there any way to currently inference the model and create an output?
-
## 2021/08/05
- `v1`
- Follow official setting
- Mel range in mel loss is different (full vs. 80-7600)
- Log base in mel loss is different (ln vs. log10)
- `v2`
- Use STFT loss instead o…
-
**Is your feature request related to a problem? Please describe.**
I am trying out text-to-speech pipeline and I pushed files to hub based similar to [this one](https://huggingface.co/facebook/tts_tr…
-
Thank you for you great job and sharing. I am a beginer in svs. I have two questions:
1. mel-feature-extract:
For the MLP-based acoustic model training, "data/dsp/core.py" is used for extract mel…
-
-
is it possible to use hifi gan vocoder on sv2tts?
-
Does VocGAN produce higher quality audio than fre-gan or hifi-gan?
Coice updated
3 years ago
-
Hello!
You seem to have done quite a bit of vocoder comparisons. I have two questions based on your own personal experience.
- Which vocoder do you feel has the best overall quality (ignoring in…
Coice updated
2 years ago