Open nikhilanayak opened 2 years ago
Would it be possible to finetune 20B using this repo and the TPU V3-8's? If so, how many TPUs would be needed and how would I have to change the code to make it work with more than 1 TPU?
Would it be possible to finetune 20B using this repo and the TPU V3-8's? If so, how many TPUs would be needed and how would I have to change the code to make it work with more than 1 TPU?