mel-gan Search Results - Githubissues

348 results
for mel-gan

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jik876/hifi-gan #4

Some questions

Hi, thanks for sharing the code, it is well appreciated. Some questions: - Do you train with mean-var normalization? If not, what is the range normalization? - I tried to plug in the models using …

george-roussos updated 2 years ago
41
ming024/FastSpeech2 #193

model cantnot fit to data, and test voice is too bad when i …

i removed the postnet(remove the code of model and loss about postnet ) and set the pitch_quantization="log",set features of pitch and enery = "frame_level", normalization="False",and other configurat…

hhm853610070 updated 1 year ago
3
ming024/FastSpeech2 #23

Support to HiFiGan

[HiFiGan](https://github.com/jik876/hifi-gan) has sota results in wav generation from mel spectrograms Is it possibile to add support to `hifigan` model, after the `mel` generation, in order to…

loretoparisi updated 1 year ago
10
Rudrabha/Wav2Lip #683

wav2lip [mp3 @ 000002a6d5b45d80] Estimating duration from bi…

(env) (base) C:\Users\prost\Wav2Lip>python inference.py --checkpoint_path checkpoints/wav2lip_gan.pth --face joseph.mp4 --audio josephvoice.mp3 Using cpu for inference. Reading video frames... Numb…

maic2209 updated 5 months ago
2
rishikksh20/HiFi-GAN #4

Load pre-trained model fails

Got this with your trained model: ``` Traceback (most recent call last): File "inference.py", line 60, in main(args) File "inference.py", line 23, in main model.load_state_dict(che…

ghost updated 3 years ago
5
kan-bayashi/ParallelWaveGAN #397

How would you train for BW extension?

I'm interested in training to convert 24 kHz mel spectrograms to 48 kHz waveforms (like HIFI-GAN2). Might not work without changing the architecture, but that's ok. How would you modify the config fil…

kelseyjd updated 1 year ago
2
csteinmetz1/auraloss #66

Vectorial representation, particular of multiscale STFT

With the rise of fast vector databases for doing approximate nearest neighbors (FLANN, annoy, chroma, milvius, weaviate, etc.), it becomes increasingly useful to have vectorial representations of audi…

turian updated 11 months ago
2
keonlee9420/Comprehensive-E2E-TTS #3

severe metallic sound

Hi, thanks for your nice jobs. I used your codes for ny own datasets and the synthesized voices seems not that normal at 160K steps now. Though we could still figure out what's being saied, the spect…

GuangChen2016 updated 2 years ago
9
facebookresearch/AudioDec #35

How to execute denoising?

@bigpon Hi I'm trying to reproduce the denoising code. https://github.com/facebookresearch/AudioDec?tab=readme-ov-file#bonus-track-denoising You mentioned following the requirements in `submit_den…

a897456 updated 1 week ago
18
huawei-noah/Speech-Backbones #12

About end2end implementation

Hi, thank you for sharing your excellent work. I want to ask about your end-to-end TTS model. In the paper, you stated that only the decoder is changed such that it can generate waveform (by using Wa…

quangnh-2761 updated 2 years ago
10

上一页 1...1 2 3 4 5 6 7...35 下一页

348 results for mel-gan

348 results
for mel-gan