-
Hi, thanks for your nice work.
Could you provide the script for extracting the acoustic feature for the TTA task?
The beginner recipe said there are four stages for this task.
- Data preparati…
-
`everyvoice synthesize --help`'s section called `Synthesize Help` talks about `text-to-spec` but there is no such command unless it is supposed to be referring to `text-to-wav`.
We should validate …
-
Thanks for very cool project.
This is the best and simple LLM-based TTS Implementation I have ever seen!
For audio quality, I highly recommend adding MS-STFT Discriminator of Encodec, and MS-SB…
-
when running "Introduction to End-To-End Automatic Speech Recognition" on google colab I run into:
```
import nemo
import nemo.collections.asr as nemo_asr
ModuleNotFoundError: No module n…
-
First of all thanks for the great model! I tested it extensively by now and ran across a few problems and performance issues which you might can help with.
1. Matching takes a lot of time with big …
-
Are the pre trained files for Hifi GAN correct? I was trying to fine tune them using ESD and after creating the forward outputs I tried to run the fine tune command
```
cp do_00152000 fine_tuned_ch…
-
This is a really good project. I was wondering if WavLM is supported in the project, I wanted to run a voice conversation model in the browser, also if Hifi-gan for voice synthesis.
-
I noticed that you used MelGAN in this paper. Have you also tried using HiFi-GAN?
-
### how did you install flake8?
```console
$ pip install flake8
```
### unmodified output of `flake8 --bug-report`
```json
{
"platform": {
"python_implementation": "CPython",
"python_…
-
I've attempted to preprocess my dataset to meet the mel-spectrogram requirements but I either wind up with incorrectly packed spectrogram files, a wrong header, or wrong data. Don't think any of the t…