-
## ❓ Questions and Help
### What is your question?
Hello, I'm having a problem to make a well-converged K-means clustering model for S2U.
I am trying to train the K-means clustering model with v…
-
What can this application actually do more than the browser-based version of GPT.
Where there is an advantage ?
-
Hello Puyuan,
First, thank you for creating and sharing this amazing work.
I'm starting to read about neural speech synthesis related recently. So the answers of these questions may be obvious:…
-
# Phone/Phoneme segment counting
This task is to count the number of phoneme segments in a given speech sample. This task is essential for evaluating the ability of models in the benchmark to accurat…
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
Hello, what should I do if I want to use your model results as the speaker ID for the speech synthesis project? Can you publish your training model
-
model: fastspeech2
i use speech from wav file and its text, duration, pitch, and energy to synthesis a new audio with another voice, i find that when i use source speech`s pitch, the voice sounds lik…
-
# Text-to-Speech Synthesis
Text-to-Speech is a speech generation task that converts written language into its spoken form.
## Task Objective
Text-to-Speech Synthesis (TTS) is an essential ta…
-
### Describe the bug
When running text-to-speech on an english model, when tts tries to write the .wav file, it runs out of memory. I'm running on cpu only. My machine has ~14GB available RAM
I …
-
My command to run
````
talk-llama.exe -mw ggml-large-v3.bin -ml marcoroni-7b-v3.Q8_0.gguf -s speak.bat
````
Added speak.bat and speak.ps1 to the root folder
speak.bat
````
@powershell -Exe…