Open jwkirchenbauer opened 9 months ago
Awesome work 🙂
Is there any plan to release the training code for cosmo-1b? Or at least details about what existing repos and framework tools were used?
Hi, we used an internal wrapper around nanotron framework, this is the training config (you will need to adapt it to work with nanotron)
Awesome work 🙂
Is there any plan to release the training code for cosmo-1b? Or at least details about what existing repos and framework tools were used?