-
Are NoLACE and DTX supposed to be usable together? I have observed that when turning on NoLACE, as soon as the stream switches to DTX mode, the decoded 0/1-byte packets start generating some noise whi…
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
Hi
I'm trying to fine tune the cyclevae model with my own data and I'm using the pretrained model of the compatible demo. I've done all the stages of preprocessing (0init123) and I don't need to do…
-
Have you tried changing the vocoder from Waveglow to HiFi-GAN? HiFi-GAN is faster and requires less VRAM. Alternatively, you could try adding a different vocoder.
-
when I'm trying to run voicefixer, I'm getting these messages:
Downloading the weight of neural vocoder: TFGAN
Traceback (most recent call last):
File "/Library/Frameworks/Python.framework/Versio…
-
I've trained WaveRNN on LJSpeech dataset with mel as condition. When generating waves, there are some bad cases occasionally shown in the following pictures.(They are the same sentence generated at di…
-
### Description
The goal is to develop a Tibetan text-to-speech (TTS) model that can convert Tibetan text into Tibetan speech. This project involves training a TTS model using filtered good audio qual…
-
- Abstract
This talk is about how audio and speech synthesis differs, how it has evolved from the last couple of years with the deep learning techniques. I will be going through both statistical and …
-
Running through your pre-trained models, I found that generated audio does not exactly match the input in duration length. For example,
```
wav, sr = load_wav(os.path.join(a.input_wavs_dir, filname…
-
are there any detailed informations to all the parameters in the config files and how they affect the audio?
```
conf/mlfb_vqvae.yml
cobf/mflb_vqvae.yml
```
I left it all on default and trained 2…