-
```
root@nvidia2:/data/so-vits-svc# svc train
[14:48:51] INFO [14:48:51] Created a temporary directory at /tmp/tmpogn07duj …
-
### Description
The goal is to develop a Tibetan text-to-speech (TTS) model that can convert Tibetan text into Tibetan speech. This project involves training a TTS model using filtered good audio qual…
-
Great work! This singing synthesis effect is the best I have heard so far, and the paper is also very scientific. Will you release the code?
-
### Model/Pipeline/Scheduler description
TorToise is a multi-voice text-to-speech system, which describes a way to apply recent advances in the image generative domain to speech synthesis. It would…
-
Hello,
Thank you for your excellent work on the lip2speech-unit project. I am currently trying to perform inference using the instructions provided. However, I encountered a problem related to the …
-
Hi. I tested the model with various kinds of wave files as source. I notice that at inference time, the model performs well with clean source files, but for those not so clean audio files (e.g. 24khz …
-
Hello. I downloaded the pretrained modal `ljspeech v3.1` and when I try to run `python gen_forward.py --alpha 1 --checkpoint pretrained-forward_step90k.pt --input_text 'this is whatever you want it to…
-
Training test with dataset tetyana without correct stress in words.
[ukr_test_40_28.zip](https://github.com/user-attachments/files/16836727/ukr_test_40_28.zip)
-
I tried on colab. it works fine with default vocoder.
It returned the error when using bigvgan.
-
**🚀 Feature Description**
Hey, we saw that there is no training code for fine-tuning all parts of XTTS V2. We would like to contribute if it adds value.
The aim can be to make it work very reliab…