jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models
MIT License
1.25k stars 110 forks source link

Can we run the replication of the results,8 * 80 A100 #55

Open zhanglv0209 opened 3 months ago

zhanglv0209 commented 3 months ago

Hello, can we run the project on 8 80G A100 cards? If not, could you please provide a reference configuration

tzadouri commented 1 week ago

^^^^