-
Hi,
I have been trying to hear my french model, but I still cannot get a good output when running inference with Tacotron 2 and HiFi-GAN, I get gibberish instead.
When I run inference, the fol…
-
The performance of speaking conversion is good, and the singing conversion is not ideal. If I do singing voice conversion, can you teach me how to use hifigan, hififan also has a pre-model with the s…
-
Hi, I got to know about your work through this discussion thread in coquiTTS : https://github.com/coqui-ai/TTS/discussions/930 and found your dataset included in TTS/recipes as well. I am interested i…
-
I saw in README that hi-fi gan inference speed is 10x faster than that of multi-band melgan. Why is hi-fi gan so fast?
-
Hi, glow-tts is really a wonderful work! I noticed that your updating in README.md
> 1) moving to a vocoder, HiFi-GAN to reduce noise, 2) putting a blank token between any two input tokens to impro…
-
tl;dr I suspect the input audio is getting resampled without a lowpass filter or otherwise improperly and then it's going to hifi-gan-bwe in the Hugging Face Space demo, uncertain if issue exists in r…
-
Hi,
I have a 15 min recording of a new speaker. I'd like to train assem-vc to perform any-to-one voice conversion. Based on my previous experience, the best and fastest way to do it would be to creat…
-
Hello! I'm trying to fine-tune the HiFi according to your fastpitch_finetuning tutorial. I generated manifests in the same format as in the example. Then I launched the fine-tuning script by simply ch…
-
it seems the paper use a serial ResBlock(kernel size=3,7,11) but the coder use parallel ResBlock
![image](https://user-images.githubusercontent.com/23380152/160524678-e396c18b-e2cf-4e12-894a-15e1ad39…
-
Hey there, I checked this out, but when I went to run `process.py`, my directory tree looked different than yours for VCTK. I downloaded it from [here](https://datashare.ed.ac.uk/handle/10283/3443) - …