Open epage480 opened 4 days ago
Hi @epage480 ,
Thanks for your validation. Yes, NF4 is currently supported.
The CPU QLoRA example uses the quantization backend of bitsandbytes, which has already enabled NF4 on Intel 4th Gen Xeon (SPR) platform as shown here.
In the example: example/CPU/QLoRA-FineTuning/qlora_finetuning_cpu.py
It mentions on a comment that nf4 is not supported on cpu yet but when I change the example from int4 -> nf4 it still runs without errors or warnings related to nf4.
Is nf4 now supported? Otherwise if it is defaulting back to int4 I think it's worth printing an error or warning.