-
-
### Model description
Vocos is a Fourier-based neural vocoder for audio synthesis.
According to its [paper](https://arxiv.org/pdf/2306.00814.pdf), Vocos constantly outperforms [HifiGan](https://hu…
-
## 🚀 Feature
Add support for 8bit conv_transpose inference operators in the form of torch.nn.quantized.functional.conv_transpose1d/2d/3d.
## Motivation
I haven't seen this mentioned elsewhere …
-
I've trained WaveRNN on LJSpeech dataset with mel as condition. When generating waves, there are some bad cases occasionally shown in the following pictures.(They are the same sentence generated at di…
-
Hi, I found that the sampling rate when training the Glow_TTS model is set to 24000 (in pretrained checkpoint), while the pretrained checkpoint in the given vocoder repo (https://github.com/CODEJIN/PW…
-
I used AhoCoder Vocoder instead of WORLD for generating wav. To adapt ahocoder with merlin, I set the BAP to 1, MGC to 40 and LF0 to 1. But acoustic model is converging to under-trained parameters.
…
-
Hi, thanks for sharing the code. I have tried it on different datasets including Chinese and English. However, there is some clipping on some of the generated waveforms (like the generated Mel spectru…
-
as mentioned in paper, will you provide pretrained weight of model?
also, reconstruction from encodec tokens using [vocos](https://github.com/charactr-platform/vocos) may boost quality of audio res…
-
```
read_wav_from_disk: Number of frames read = 1577459.
ggml_new_object: not enough space in the context's memory pool (needed 39200416, available 39200096)
compress: /mnt/c/prog/fork/encodec.cpp/…
-
are there any detailed informations to all the parameters in the config files and how they affect the audio?
```
conf/mlfb_vqvae.yml
cobf/mflb_vqvae.yml
```
I left it all on default and trained 2…