Open galv opened 1 year ago
Version 0.4.0:
Topology | Model | Test-Clean | Test-Other | Dev-Clean | Dev-Other |
---|---|---|---|---|---|
Vanilla | Small | 3.56%, 4464.1221 | 7.12%, 4527.6245 | 3.44%, 3998.9165 | 6.99%, 4528.6743 |
Vanilla | Medium | 3.20%, 3721.2207 | 5.73%, 4082.0947 | 2.80%, 3726.6550 | 5.46%, 4099.2676 |
Vanilla | Large | 2.74%, 2321.9045 | 4.53%, 2189.0354 | 2.49%, 2132.1375 | 4.31%, 2574.2717 |
Compact | Small | 3.17%, 4320.1294 | 6.83%, 4110.2827 | 2.97%, 3721.0525 | 6.68%, 4458.4355 |
Compact | Medium | 2.65%, 3726.7332 | 5.32%, 3909.8123 | 2.27%, 3621.8477 | 5.03%, 3800.0156 |
Compact | Large | 2.21%, 2261.0801 | 4.10%, 2205.2964 | 1.87%, 2196.7666 | 3.90%, 2530.3594 |
Just using this github issue as a markdown scratchpad to create a table of results to show later.
Entry in each table cell is: "WER, RTFx"
"Small" models refers to Conformer CTC Small. "Medium" model refers to Conformer CTC Medium. "Large" model refers to Conformer CTC Large.
Used the Arpa LM from https://www.openslr.org/resources/11/3-gram.pruned.3e-7.arpa.gz
All models were run in half precision
Relevant hyperparameters:
All results obtained with an A100-80GB GPU, on a 16 core CPU server.