-
**Problem statement**
Previous TTS models often produced robotic-sounding speech, mispronounced words, lacked emotional nuance, struggled with contextual understanding, offered limited language suppo…
-
I realise that when I remove adversarial loss and feature match loss, it still works well and has no degeneration of performance. This makes me question the role of adversarial training in reduction o…
-
Hi
I was using the below arguments in the normal TTS can you add these to the do_tts? I am getting syntax error when I try do it my self, am a bit of rookie so am not sure why.
` parser = argpar…
-
When I run the sample script I keep getting this error message among others...not sure how dire it is or whether it even impacts performance...
```
You set `add_prefix_space`. The tokenizer needs …
-
Hello authors,
First of all, thank you for giving us an impressive repository.
For now, I want to re-trained your model with Korean language, for example KSS (korean single speaker). However, when …
-
![QQ截图20240310024054](https://github.com/rsxdalv/tts-generation-webui/assets/83493228/076ca128-2f13-41e0-8033-74064c7cdb88)
Translate other languages via audio files like this!
curui updated
8 months ago
-
What kinds of algorithms have you used to segment such long audios? The forced aligner could have some limitation to segment a long audio at once.
-
Using the methods of [XProcMessageListener](http://grepcode.com/file/repo1.maven.org/maven2/com.xmlcalabash/xmlcalabash/1.0.13-94/com/xmlcalabash/core/XProcMessageListener.java), error codes can be as…
-
Hey man, amazing work!
Do you have plans to release the code?
-
As my experiment , the result of glow-tts sounds more like robot than a real person, do you have any method could make it more nature, like the result or autoregressive model, like tacotron eg..thanks…