-
1. Download and unpack https://keithito.com/LJ-Speech-Dataset/ (a huge voice dataset incl. the script).
2. Install RVC WebUI (https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI).
…
-
As I understand it, this tts-algorithm works with your audio files without assigned text.
1. How would it understand the content, language?
2. Is it working with the lj-speech set only or a datase…
-
Hello there, devs of Style TTS2, it's a great model, you really did a good job.
I mainly use it on the hf demo, but there are some issues:
Firstly, it pauses after the dash - symbol, so please fix i…
-
Thanks fro sharing the repo. I have trained the model using this repo on LJ speech. I am performing inference using only GST. During inference i use a out of dataset file as style file. The synthesiz…
-
after 140k I'm stuck around 0.08
-
I have used the pretrained model as provided in the google drive of the official repo. Based on the check point of the pre-trained model when I executed the infernce.py file, the generated samples qua…
-
I have done a lot of training on different self-made datasets (typically having around 3 hours of audio across a few thousand .wav files, all 22050 Hz) using Tacotron, starting from the pretrained LJS…
-
First of all, Great Work! Thanks for sharing the repo!
I have trained the autoregressive model on LJ dataset. The output is quite good for short sentences. I seek some advice to manipulate pauses b…
-
I have done a lot of training on different self-made datasets (typically having around 3 hours of audio across a few thousand .wav files, all 22050 Hz) using Tacotron, starting from a pretrained LJSpe…
-
In papers you say about LJ speech dataset test (4.3 Content replacement). Can you provide code for loading voice files instead pure sample generation in tts.py?