Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
3.32k
stars
238
forks
source link
Pseudo-labelling librispeech_asr (train.360): KeyError `train-360` when not streaming. #96
Open
guynich opened 3 months ago
When not streaming this line results in KeyError
train-360
. The pseudo-labelled dataset was not saved after hours of compute.I think this KeyError might be caused by this code line that changes the split name.
My bash script uses the Librispeech_asr split name
train.360
as defined here.