-
Training speech recognition and text-to-speech models from scratch in Azerbaijani will require a comprehensive dataset of high-quality audio and corresponding text transcriptions. Here are the steps t…
-
2 freely available possible datasets have already been identified, more are welcome:
1. Mozilla Common Voice https://voice.mozilla.org/en
CC-0 license
2. Openslr resources http://openslr.o…
-
Hi!
First of all, thank you for your code and your models! Really really useful!
I've used the finetuning script to try to finetune it to spanish with common voice dataset. However, after infer…
-
**Is your feature request related to a problem? Please describe.**
I'm mainly use RVC to voice different characters, if most of time it works well enough, in some cases like screams, breath, laughs o…
-
Hi @nkrao220,
My network (moz_cnn_2.py) is not generalizing at all. The validation accuracy goes up by 2-3% and is stuck at 50%. Please, could you give me some pointers on how you trained yours and…
-
## Describe the bug
The function load_from_disk fails when using a remote filesystem because of a wrong temporary path generation in the load_from_disk method of arrow_dataset.py:
```python
if …
-
Hey there,
I'm trying to fine-tune the TTS model for the German language, but I'm fairly new to this field. I've tried various approaches and datasets, such as the German part of the M-AI Labs data…
-
I wish for the merging of both the gom and knn locales. However, this might not be possible in its entirety.
But, as a start, the sentences collected by both locales (romi and devanagari) could be …
-
# Welcome to the Common Voice Community !
> Common Voice aims to make speech technology accessible to everyone by building an open sourced dataset of labeled voice data that is representative of la…
-
I originally thought that this issue is only specific to the zh-hk locale, but later realize that this is quite widespread and seriously harming the data quality of many languages. So currently, some …