k2-fsa / snowfall

Moved to https://github.com/k2-fsa/icefall
Apache License 2.0
143 stars 42 forks source link

WIP: BPE Training with k2 ctc loss #234

Closed pkufool closed 2 years ago

pkufool commented 2 years ago

FYI @glynpu

My current result are as follows (without transformer decoder, say att_rate = 0.0):

decoding model WER % on test-clean WER % on test-other
avg epoch 18 - 28 11.94 26.78
avg epoch 9 - 13 31.65 58.88
pkufool commented 2 years ago

There was a bpe_run.sh in master branch, posted by Liyong in https://github.com/k2-fsa/snowfall/pull/217, it only did decoding things, so I rename it to bpe_decode.sh. All the training and decoding pipeline are in bpe_run.sh script in this PR.

pkufool commented 2 years ago

We will discuss BPE training with k2 ctc loss in this PR https://github.com/k2-fsa/snowfall/pull/219