mit-han-lab / offsite-tuning

Offsite-Tuning: Transfer Learning without Full Model
https://arxiv.org/abs/2302.04870
MIT License
367 stars 38 forks source link

How to run distributed evaluation for big models used in this paper? #4

Open sanyalsunny111 opened 1 year ago

sanyalsunny111 commented 1 year ago

It seems all the eval for LLMs are done using 1 GPUs can you suggest ways to run distributed eval?