-
I originally thought that this issue is only specific to the zh-hk locale, but later realize that this is quite widespread and seriously harming the data quality of many languages. So currently, some …
-
### User story
* As a catalog user, I'd like to limit my results to Tagalog language results, when Tagalog is not one of the most common languages in my search results.
* As a catalog user, I'd li…
-
# Welcome to the Common Voice Community !
> Common Voice aims to make speech technology accessible to everyone by building an open sourced dataset of labeled voice data that is representative of la…
-
**Describe the bug**
### Cantonese users are brought to other "zh" languages automatically.
The issue I am reporting is that Common Voice automatically takes me to the Chinese Hong Kong language …
-
## General background
* Thanks to the advancement of HTML5 and related Web technologies, Web applications with speech capability is getting more and more popular for ease of user-interaction and ri…
-
Hey Common-Voice team!
Thanks a lot for releasing the common voice 7 dataset - it's great to see so many new languages!
At Hugging Face, we have worked a lot with the common voice 6.1 dataset an…
-
# Welcome to the Common Voice Community !
> Common Voice aims to make speech technology accessible to everyone by building an open sourced dataset of labelled voice data that is representative of l…
-
# Welcome to the Common Voice Community !
> Common Voice aims to make speech technology accessible to everyone by building an open sourced dataset of labelled voice data that is representative of l…
-
**[ UUID ]** 97a3b0ee-aa3b-4c77-993f-cd6af659c5e4
**[ Session Name ]** Speaking in Many Tongues: Mozilla’s Common Voice Project
**[ Primary Space ]** Digital Inclusion
**[ Submitter's Name ]** Dely…
-
https://github.com/Common-Voice/common-voice-wiki-scraper/blob/a23abced7713c2260f78fc77252727fe719d6eca/src/checker.rs#L37
Here you split words just around white spaces. You should use word boundar…