-
Hi,
Given a target speaker dataset what is roughly the number of fine tuning steps that should be undergone ?
[NeMo](https://github.com/NVIDIA/NeMo/blob/main/tutorials/tts/FastPitch_Finetuning.i…
-
**Describe the bug**
I tried to export FastPitch model to PyTorch but failed with a following error. It happens with all EN models `tts_en_fastpitch_ipa`, `tts_en_fastpitch`, and `tts_en_fastpitch…
-
First of all thanks for very nice TTS system. This is very interresting and inspiring system.
1. I tried to train it, but it seems to train very slowly I see 0.5 to 1.6 iterations per second. At …
Whyki updated
5 months ago
-
I try to generate the speech using previously trained FastPitch model. But I have an issue with loading a FastPitch model.
What could be a reason for this kind of error?
```
DLL 2024-04-10 17:00:…
-
```
def text_to_speech(text, tts_model_name, vocoder_model_name, max_length=100, file_name_prefix=""):
torch.set_grad_enabled(False)
sampling_rates = {
"tts_en_tacotron2": 22050,…
-
I want to use python backend with triton to deploy TTS model using hifigan and fastpitch. When I infer hifigan, I meet the error
tritonclient.utils.InferenceServerException: [400] Failed to process…
-
**Describe the bug**
I follow the sample code in [NGC](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/tts_zh_fastpitch_hifigan_sfspeech) to generate chinese TTS audio
but get a bed r…
-
### Describe the bug
While using the Silero TTS extension, I encountered an error when providing long text inputs. The model seems to have a limitation on the length of the input text it can handle.
…
-
Hardware - GPU (T4)
Hardware - CPU
Operating System - ubuntu 20.04 running on AWS EC2 g4dn.2xlarge instance
I am currently trying to convert a model (several of different types but for now not ev…
-
These are the main dev plans for :frog: TTS.
If you want to contribute to :frog: TTS and don't know where to start you can pick one here and start with our [Contribution Guideline](https://github.…