k2-fsa / icefall

https://k2-fsa.github.io/icefall/
Apache License 2.0
792 stars 267 forks source link

Seeking advice on parameter configuration and settings for large-scale ASR models #1596

Closed brainbpe closed 4 weeks ago

brainbpe commented 1 month ago

We are planning using icefall and zipformer to train large-scale ASR models and will likely conduct multiple experiments with three different sizes: 600M, 1B (similar to Whisper Large), and 5B. We are seeking advice on parameter configuration and settings for these models. Our goal is to achieve the best possible performance. We have observed that the largest publicly available model has approximately 140M parameters in icefall ,and Our training data consists of several million hours of audio.

Iany good suggestions for this? @danpovey @pingfengluo @nshmyrev

danpovey commented 1 month ago

That's nice! For each doubling of parameter size, I would probably: