jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models
MIT License
1.32k stars 115 forks source link

Training system configuration #25

Closed shossain closed 12 months ago

shossain commented 1 year ago

Could you share the number of GPUs, VRAM size used for finetuning?

Thanks!

cebtenzzre commented 1 year ago

See https://github.com/jquesnelle/yarn/issues/9#issuecomment-1702150089