-
I don't know if kira has this capability, but when I call set_playback_rate on my sound, I can speed up the sound, but the pitch change makes it sound strange. I want to be able to play the sound at 2…
-
Hi, just saw your repo, and bit confused regarding the architecture and philosophy behind you TTS model. Could please add little bit regarding your architecture, like you training LLM for TTS but you …
-
[HiFiGan](https://github.com/jik876/hifi-gan) has sota results in wav generation from mel spectrograms
Is it possibile to add support to `hifigan` model, after the `mel` generation, in order to…
-
Hey there, I'm currently working on Polish model based on keithito implementation.
I've succesfully frozen graph for inference. I'm grabbing mel's from BiasAdd:0 and then I've implemented Griffin-Lim…
-
hi, I initial text2speech using my own am_model and vocoder and export onnx model, but sound quality drops significantly, I just modify hifigan inference code in [https://github.com/Masao-Someki/espne…
-
on windows, anaconda
-
Hi,
Did you not use sequence level knowledge distillation for fastpeech training??
-
When I go to synthesize and submit, this is the error message that appears: Text: Expecting value: line 1 column 1 (char 0)
Full: Traceback (most recent call last): File "flask\app.py", line 1950, in…
-
-
Any plans on implementing Mel Cepstral Distortion scores for LJSpeech? I found some useful repositories:
[SamuelBroughton](https://github.com/SamuelBroughton/Mel-Cepstral-Distortion)
[MattShannon](h…