-
Though a user would still have to know how install this with pip, it would be nice to add a GUI. Looks like this could be accomplished by adding just a few lines with https://github.com/chriskiehl/Goo…
-
测试音频,请参考附件,音频25s,能正常播放,但识别时报错
File "/home/roots/data/SenseVoice/webui.py", line 167, in model_inference
text = model.generate(input=input_wav,
File "/home/roots/anaconda3/envs/coqui-x…
-
### Describe the bug
Sometimes redundant duplicated text is generated. I use default model and config (no fine-tuning). Occurrence rate is not 100%, it happens sometimes (that is why I use a loop i…
-
xTTS v1.1 has no dropdown speaker selection, but doesn't work even if the "VC/Clone sample" checkbox is ticked and a .wav file is chosen.
Error in the terminal as follows:
```bash
ConfigureVoiceTab…
-
i have cuda, my wsl envirnment reconises it , but i suppose the docker container from this project does not yet support cuda?
(there are no cuda toolkit commands available and also epub2tts does n…
-
### Describe the bug
It seems that there is hidden issue behind the dataset preparation for fine-tuning TTS on Japanese Language
### To Reproduce
1. Clone the repo and install the pacakges.
``` …
-
Log:
2024-05-21 07:21:35,143 - INFO - PyTorch and xtts_api_server package installed successfully.
2024-05-21 07:21:35,143 - INFO - Installing requirements for pandrator_installer...
2024-05-21 07:2…
-
Hi,
I noticed that the repository uses a really smart approach to account for text length by segmenting the text into smaller chunks and generating audio for each segment separately.
But there's…
-
Apart from the GPT model (which has been implemented), there are 4 other models in TorToiSe that could be fine-tuned:
* the VQVAE, which learns how to encode the training data,
* CLVP, which deter…
-
I'm working on a project that requires efficient handling of multiple concurrent streaming requests. I have some specific requirements and challenges that I'd like advice on:
Scalability: I need …