issues
search
common-voice
/
cv-dataset
Metadata and versioning details for the Common Voice dataset
https://commonvoice.mozilla.org/datasets
Mozilla Public License 2.0
141
stars
15
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Change request: Please order age ranges chronologically
#35
KathyReid
opened
1 month ago
1
A small request for column & field naming
#34
HarikalarKutusu
opened
3 months ago
0
How many peoples in all dataset?
#33
wntg
closed
4 months ago
4
Fix json incompatible commas in old single word dataset metadata
#32
HarikalarKutusu
opened
6 months ago
0
Some mp3 files in cv corpus 4 are empty
#31
yundaqwe
opened
7 months ago
6
Minor Bug in Text Corpus calculations
#30
HarikalarKutusu
opened
7 months ago
1
有没有中文native-speaker能帮忙解释下各个*.tsv的意思
#29
Liujingxiu23
closed
10 months ago
0
FEATURE REQUEST: Please add `duration` as a metadata item that is included in the `*.tsv` files with a release
#28
KathyReid
closed
8 months ago
2
Feature request: Datasets with only validated recordings
#27
soliviantar
opened
1 year ago
0
FEATURE REQUEST: Make the `.tsv` files that are part of a downloaded dataset available separately
#26
KathyReid
opened
1 year ago
1
Error: Version 15 summary data does not contain nested objects for splits (age, gender) and buckets (validation)
#25
KathyReid
closed
1 year ago
3
Oi 2763 data release v 15 full delta
#24
moz-dfeller
closed
1 year ago
1
need label about sample clean or noisy
#23
JohnHerry
closed
1 year ago
3
feat: add cv-corpus-13.0-delta-2023-03-09.json
#22
moz-dfeller
closed
1 year ago
0
Wrong checksums for Common Voice Corpus 13.0
#21
paniedziela
opened
1 year ago
1
German 12.0 Segment missing train dev test TSV files
#20
LozramA
opened
1 year ago
3
Feature Request: More digits for percentage values.
#19
HarikalarKutusu
opened
1 year ago
0
Add cv12 data
#18
mozgzh
closed
1 year ago
0
Wrong duration value in ar v10.0
#17
HarikalarKutusu
closed
1 year ago
3
Add CV11 Statistics
#16
mozgzh
closed
2 years ago
0
Bug: Discrepancy for locale "eo" in v10.0 dataset
#15
HarikalarKutusu
closed
1 year ago
0
Feature request: CSV
#14
bulvara
opened
2 years ago
3
Add CV10 data
#13
mozgzh
closed
2 years ago
0
Bug: accents splits are not shown in dataset JSON summary after release 7
#12
KathyReid
closed
2 years ago
3
CV 9 Dataset metadata
#11
mozgzh
closed
2 years ago
0
Feature request: list sampling rates in dataset, download dataset given sampling rate
#10
rafaelvalle
opened
2 years ago
0
Dataset 8
#9
mozgzh
closed
2 years ago
0
Add CV 8.0 metadata
#8
JRMeyer
closed
2 years ago
0
Feature request: Summary data of each language including rows with metadata, gender, age, accent distribution
#7
KathyReid
opened
2 years ago
3
Possibility to release tar file with just the additional data
#6
siddalmia
opened
3 years ago
2
Script to download all the datasets
#5
harrygcoppock
closed
3 years ago
1
.tsv files not found
#4
zuther77
closed
3 years ago
1
Add stats and changelog for 6.x releases
#3
phirework
closed
3 years ago
0
Download format is .tar instead of .tar.gz
#2
HaritYadav
opened
3 years ago
4
Fix stereo files to mono
#1
Mte90
closed
3 years ago
5