jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models
MIT License
1.25k stars 110 forks source link

Hardware equipments and training time? #37

Open zhoumengbo opened 7 months ago

zhoumengbo commented 7 months ago

I am very curious about the hardware equipment you use for training and the time it takes for the training. Do you have a detailed introduction? If so, I would be extremely grateful.