Open karim1104 opened 1 year ago
I'm facing the above error in both stage 1 and stage 2 when using BLOOMZ 3B and 560M. I tried adding "model.to(device)" and "model.to('cuda')" to main.py but neither worked. The error only appears when I switch from Llama to BLOOMZ.
hi, whether you have figured out the reason?
I'm facing the above error in both stage 1 and stage 2 when using BLOOMZ 3B and 560M. I tried adding "model.to(device)" and "model.to('cuda')" to main.py but neither worked. The error only appears when I switch from Llama to BLOOMZ.