issues
search
SpeechColab
/
GigaSpeech
Large, modern dataset for speech recognition
Apache License 2.0
649
stars
62
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Does the dataset contain TV-series and movies?
#139
Ziyi6
opened
1 week ago
0
Preparation of dataset from scratch in other language
#138
Tortoise17
opened
2 months ago
0
whats array inside gigaspeech?
#137
rrscholarship
opened
6 months ago
0
Update the bibtex entry to avoid the "Too many commas" error in LaTeX
#136
JinZr
opened
9 months ago
0
请问什么时候release speaker label
#135
XiXiRuPan
opened
10 months ago
0
why GIGASPEECH dev & test doesn't match the report ?
#134
dahu1
closed
1 year ago
0
New SOTA: Zipformer in k2
#133
yfyeung
closed
1 year ago
0
Clean the original dataset that collected from different resources YouTube , Podcast, and Audiobook.
#132
kerolos
opened
1 year ago
1
Can you provide "text_raw" information?
#131
lifeiteng
opened
1 year ago
2
[Mismatch] <SIL>, <MUSIC> etc. only appear in the validation and test split, never in train
#130
jasonppy
closed
1 year ago
1
duplicates and some youtube video links are wrong. Observations [not issue]
#129
npovey
opened
1 year ago
0
Number of words in training text
#128
huangruizhe
closed
1 year ago
2
deprecate utils/save_segments_in_flac.py due to high-freq aliasing bug
#127
dophist
closed
1 year ago
0
gigaspeech.json里没有audio/podcast/P0081-P0084
#126
wwfcnu
opened
1 year ago
15
"begin_time" is larger then audio length
#125
Wonder1905
closed
2 years ago
0
About gigaspeech glm file
#124
CuiMingyu
opened
2 years ago
2
Fix Athena data prep bug
#123
dophist
closed
1 year ago
1
Add utt2spk_to_spk2utt.pl to repo and fix path
#122
npovey
closed
2 years ago
0
Can't locate utt2spk_to_spk2utt.pl file
#121
npovey
closed
2 years ago
2
Gigaspeech egs in Kaldi stopped
#120
leohuang2013
closed
2 years ago
6
Inconsistency in local/gigaspeech_data_prep.sh
#119
leohuang2013
closed
2 years ago
3
update Readme with HuggingFace links
#118
dophist
closed
2 years ago
0
GigaSpeech on HuggingFace
#117
dophist
opened
2 years ago
1
bug fixed
#116
bxcxa
closed
2 years ago
0
How can I continue to download from the disconnection point?
#115
guo453585719
opened
2 years ago
2
Is XL subset the 33000hr unlabeled data?
#114
mct10
opened
2 years ago
1
Add icefall RNN-T results
#113
wgb14
closed
2 years ago
6
Unable to download `M` subset
#112
sanchit-gandhi
closed
2 years ago
6
Model size and training time on the leaderboard
#111
csukuangfj
closed
2 years ago
1
Still getting "utils/internal/download_gigaspeech_with_pyspeechcolab.sh: This recipe needs the package speechcolab installed." message
#110
makwadajp
closed
2 years ago
2
Add results from Icefall
#109
wgb14
closed
2 years ago
7
download problem
#108
zcswdt
closed
2 years ago
1
minor fix for gigaspeech_data_prep.sh
#107
shanguanma
closed
2 years ago
0
Add neurst result to leaderboard
#106
mct10
closed
2 years ago
1
Revert "Add NeurST results to leaderboard"
#105
chenguoguo
closed
2 years ago
0
Add NeurST results to leaderboard
#104
mct10
closed
2 years ago
2
Punctuation removal in Athena/prepare_data.py fuses words in transcripts added to .csv files
#103
IanMGriff
closed
2 years ago
2
Failed to convert opus to wav
#102
mct10
closed
2 years ago
2
What is the source of GigaSpeech Podcast and Audiobook?
#101
xiaobobo-bilibili
closed
2 years ago
2
Missmatch Sample rate Opus files
#100
aheba
opened
2 years ago
1
add handy tool to extract gigaspeech subset segments, e.g. dev/test sets
#99
dophist
closed
2 years ago
0
Incorrect character in GigaSpeech.json
#98
dscripka
closed
2 years ago
3
utt2spk_to_spk2utt.pl: Command not found when running toolkits/kaldi/gigaspeech_data_prep.sh
#97
ddlBoJack
closed
3 years ago
1
Verify password
#96
jimbozhang
closed
3 years ago
0
Connection Error?
#95
gray4what
closed
3 years ago
1
download dev and test subsets
#94
chaisz19
closed
3 years ago
0
zlib.error: Error -3 while decompressing data: incorrect header check
#93
ddlBoJack
closed
3 years ago
6
Error when downloading dataset
#92
CuiMingyu
closed
3 years ago
1
Can you add option to skip check files have been downloaded?
#91
al3chen
closed
2 years ago
2
Add utt2spk_to_spk2utt
#90
chaisz19
closed
2 years ago
2
Next