coqui-ai / STT-models

Open models for Coqui STT
https://coqui.ai
117 stars 35 forks source link

yesno model: training data not available #6

Closed jeremiahrose closed 3 years ago

jeremiahrose commented 3 years ago

The documentation (here) for the English yesno model says that the model was trained on the Common Voice Target Segments Corpus.

However this corpus does not seem to be available online. The documentation should provide a link to the location of this dataset.

reuben commented 3 years ago

Search for "target segment": https://commonvoice.mozilla.org/en/datasets

reuben commented 3 years ago

And I guess generally the full metadata is available on this repo: https://github.com/common-voice/cv-dataset/

jeremiahrose commented 3 years ago

Ah, great. The Single Word Target download is really not obvious on that page. Thankyou