-
Hi, thanks for sharing the code, it is well appreciated. Some questions:
- Do you train with mean-var normalization? If not, what is the range normalization?
- I tried to plug in the models using …
-
i removed the postnet(remove the code of model and loss about postnet ) and set the pitch_quantization="log",set features of pitch and enery = "frame_level", normalization="False",and other configurat…
-
[HiFiGan](https://github.com/jik876/hifi-gan) has sota results in wav generation from mel spectrograms
Is it possibile to add support to `hifigan` model, after the `mel` generation, in order to…
-
(env) (base) C:\Users\prost\Wav2Lip>python inference.py --checkpoint_path checkpoints/wav2lip_gan.pth --face joseph.mp4 --audio josephvoice.mp3
Using cpu for inference.
Reading video frames...
Numb…
-
Got this with your trained model:
```
Traceback (most recent call last):
File "inference.py", line 60, in
main(args)
File "inference.py", line 23, in main
model.load_state_dict(che…
ghost updated
3 years ago
-
I'm interested in training to convert 24 kHz mel spectrograms to 48 kHz waveforms (like HIFI-GAN2). Might not work without changing the architecture, but that's ok. How would you modify the config fil…
-
With the rise of fast vector databases for doing approximate nearest neighbors (FLANN, annoy, chroma, milvius, weaviate, etc.), it becomes increasingly useful to have vectorial representations of audi…
-
Hi, thanks for your nice jobs. I used your codes for ny own datasets and the synthesized voices seems not that normal at 160K steps now. Though we could still figure out what's being saied, the spect…
-
@bigpon Hi
I'm trying to reproduce the denoising code.
https://github.com/facebookresearch/AudioDec?tab=readme-ov-file#bonus-track-denoising
You mentioned following the requirements in `submit_den…
-
Hi, thank you for sharing your excellent work.
I want to ask about your end-to-end TTS model. In the paper, you stated that only the decoder is changed such that it can generate waveform (by using Wa…