Closed fujimotos closed 1 year ago
Progress Created augmented data and conducted ASR experiments on 300h ReazonSpeech Medium set using MUSAN.
--fg-interval 1 --fg-snrs "15:10:5:0"
(fairly strong with information loss in conversion)CER on models trained from scratch | medium | medium+augmented | |
---|---|---|---|
JSUT 5000 | 19.39% | 17.78% | |
CV v8.0 test | 22.85% | 20.42% |
CER on finetuned model | Whisper large v2 | RS 15k | RS 15K augmented finetuned | |
---|---|---|---|---|
JSUT 5000 | 8.17 | 8.23 | 8.33 | |
CV v8.0 test | 9.70 | 9.93 | 9.56 |
ノイズに対するロバストネスの向上
チケットのゴール
参考リンク