Open x4080 opened 1 year ago
Yes, just modify the utils.py
following the example from HF blog:
model = AutoModelForCausalLM.from_pretrained("facebook/opt-350m", load_in_4bit=True, device_map="auto")
We'll update the code after testing.
cool, thanks @JetRunner
from https://huggingface.co/blog/4bit-transformers-bitsandbytes ?