-
Hi,
I am trying to transcribe live stereo audio to mono audio and transcribe them, is there any recommended methods to implement this, I have tried converting stereo to mono and my result is very i…
-
-
**🚀 Feature Description**
This is a request for improving the documentation. On the readme, you have a
- List the available speakers and choose a among them:
`$ tts --model_name "//" --list_…
surak updated
1 month ago
-
### What happened?
The transcript of a 1h multi speaker file generates the following output:
00:00 --> 01:20
Speaker 1:
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!…
-
## Dataset Format
The pre-processing script expects data to be a directory with:
* `metadata.csv` - CSV file with text, audio filenames, and speaker names
* `wav/` - directory with audio files
The …
-
### Description
We have to train the current mms model on multi speakers.
### Completion Criteria
A mms tts model with multi-speakers option using speaker id.
-
Hello, thank you for your open source! Is the if branch in the "assert_required_models_available" function in matha/cli.py written in reverse? When I specify checkpoint_path, it keeps downloading the …
-
### Describe the bug
not supporting lon texts mor than 1000 tokens
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Reproduction
ask something large
### Scr…
-
I was wondering if anyone was successful with finetuning a styletts2 base model on a different language eg. French, Spanish, etc... and achieved good results, if so someone kind enough to share some o…
-
Hello,
I just trained on two speakers at the same time.
The filelist looks like this:
```
/home/ubuntu/RVC-beta-v2-0528/logs/merged/0_gt_wavs/0_4_48.wav|/home/ubuntu/RVC-beta-v2-0528/logs/merg…
Rolun updated
5 months ago