Open astarostap opened 1 year ago
I am encountering the same issue using 1 A100 GPU 40 GiB for fine tuning.
solved it by using bitsandbytes=0.37.2 version
Facing same issue with 4 A40 48 GB
What's the recommended number and type of GPUs for finetuning this ?
any update?
Looks like GPU usage almost doubles during saving (save_pretrained - get_peft_model_state_dict function). Is there a way to avoid this?
stack trace: