-
**Describe the bug**
在AISHELL2+librispeech+cstal上训练的时候遇到的警告:
```
~/miniconda/envs/wenet/lib/python3.8/site-packages/torch/distributed/algorithms/join.py:258: UserWarning: Detected uneven input skew…
-
Bit rate=8k
Downstream tasks (only 16khz model used)
```
Stage 1: Run speech emotion recognition.
Acc: 75.21%
Stage 2: Run speaker related evaluation.
Parsing the resyn_trial.txt for resyn w…
Slyne updated
4 months ago
-
**Debugging checklist**
[x] Have you updated to latest MFA version? Yes
[x] Have you tried rerunning the command with the `--clean` flag? Yes
**Describe the issue**
I want to generate TextGrid…
-
## Description
Can documentation be created on how to convert text to speech using the new audio extension? Preferably an example.
This is going to help users who are trying to implement such a …
-
100K cache entries (1%) have the `ConfigNamesError`. It would be better to show the underlying error, and help the user debug their data files.
-
Thanks for sharing this great repo!
I'm wondering what is the typical range of `num_train_steps` for a SoundStream model and others.
I tested with 10000 and saw the loss went down somewhat smoot…
-
The piper-phonemizer setup is a bit confusing at the moment as it's both a included with some significant code and a library imported at runtime. The two phonemizers text and espeak are both tightly …
-
### Describe the bug
Here's a detailed description of the issue you're encountering along with the relevant error message, translated into English:
---
### Issue Description
I am following…
-
Hello all,
I am a beginner in this domain and doing my first experiment.
after training the model, I got the .h5 model file. But while testing, Librispeech_train_4_1030.subword is not reading the vo…
-
As discussed during our @huggingface/datasets meeting, we are planning to move some "canonical" dataset scripts under their corresponding organization namespace (if this does not exist).
On the con…