issues
search
coqui-ai
/
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
MIT License
1.28k
stars
140
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
adding inaGVAD corpus
#221
DavidDoukhan
opened
5 months ago
0
Persian tts dataset
#220
karim23657
opened
1 year ago
0
Added german Thorsten-Voice datasets.
#219
thorstenMueller
closed
2 years ago
0
podcast fillers
#218
JRMeyer
opened
2 years ago
0
Facestar
#217
JRMeyer
opened
2 years ago
0
TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline
#216
JRMeyer
opened
2 years ago
0
XTREME-S dataset
#215
jhdeov
opened
2 years ago
0
Who <!--
#214
Jerryagonoy25
closed
2 years ago
0
Santa Barbara Speech Corpus
#213
JRMeyer
opened
2 years ago
0
Kokoro Japanese TTS single speaker
#212
JRMeyer
opened
2 years ago
0
male LJSpeech italian
#211
JRMeyer
opened
2 years ago
0
CrowdSpeech
#210
JRMeyer
opened
2 years ago
0
ShEMO: a large-scale validated database for Persian speech emotion detection
#209
JRMeyer
opened
2 years ago
0
KsponSpeech (Korean conversations)
#208
JRMeyer
opened
2 years ago
0
JTubeSpeech (Japanese Youtube)
#207
JRMeyer
opened
2 years ago
0
EmoV-DB (emothional synthesis)
#206
JRMeyer
opened
2 years ago
0
finnish parlament
#205
JRMeyer
opened
2 years ago
0
databases from CMU speech group
#204
JRMeyer
opened
2 years ago
0
Sadilar corpora
#203
JRMeyer
opened
2 years ago
0
all podcasts dataset
#202
JRMeyer
opened
3 years ago
0
Arabic corpus
#201
JRMeyer
opened
3 years ago
0
Quran recitation (kaggle)
#200
JRMeyer
opened
3 years ago
0
falabrasil portuguese
#199
JRMeyer
opened
3 years ago
0
EasyComDataset (cocktail party effect)
#198
JRMeyer
opened
3 years ago
0
spoken word QA dataset
#197
JRMeyer
opened
3 years ago
0
Agriculture keywords (english + luganda)
#196
JRMeyer
opened
3 years ago
0
key words for african languages
#195
JRMeyer
opened
3 years ago
0
brazilian portuguese emotion recognition
#194
JRMeyer
opened
3 years ago
0
Voxlingua 107 (6k hours)
#193
JRMeyer
opened
3 years ago
0
WeNetSpeech (10k mandarin)
#192
JRMeyer
opened
3 years ago
0
Do some one train with Japanese?
#191
kju196
closed
3 years ago
0
kreyòl ayisyen :)
#190
JRMeyer
opened
3 years ago
1
Included Odia
#189
psubhashish
closed
2 years ago
2
2k hours japanese TV
#188
JRMeyer
opened
3 years ago
0
data is CC0! 400k hours unlabeled voxpopuli
#187
JRMeyer
opened
3 years ago
0
10k hours japanese youtube
#186
JRMeyer
opened
3 years ago
0
qualcomm hotwords ( hey snapdragon)
#185
JRMeyer
opened
3 years ago
0
kerstin german TTS
#184
JRMeyer
opened
3 years ago
0
Kaggle ukrainian
#183
JRMeyer
opened
3 years ago
0
Added pull_request_template + CODE_OF_CONDUCT
#182
kdavis-coqui
closed
2 years ago
1
SpiCE Corpus === english / cantonese
#181
JRMeyer
opened
3 years ago
0
Diarization Datasets
#180
JRMeyer
opened
3 years ago
0
Mongolian 300 synthetic STT data + others
#179
JRMeyer
opened
3 years ago
3
TwB corpora
#178
JRMeyer
opened
3 years ago
0
media speech: french / arabic / spanish / turkish
#177
JRMeyer
opened
3 years ago
0
english 12 speaker anechoic chamber cc-by 3.0
#176
JRMeyer
opened
3 years ago
0
odia and indic langs
#175
JRMeyer
opened
3 years ago
0
Add several Czech corpora
#174
comodoro
closed
3 years ago
0
Datasets from jace-assistant
#173
JRMeyer
opened
3 years ago
0
african NLP / ASR data
#172
JRMeyer
opened
3 years ago
0
Next