OpenGVLab / OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
MIT License
672 stars 52 forks source link

The llama-2-7b model can't quant in this code #93

Open Hzqskywkr opened 1 week ago

Hzqskywkr commented 1 week ago

Dear authors, we have tried llama-2-7b omniquant in W4A4 following the given script. It is not stable nor successful in training as follows, the loss in the first epoch is NAN. The training scheme is the same. Are there any solutions to fix it? We set the epoch=1, CUDA_VISIBLE_DEVICES=0 python main.py \ --model hzq/llama/llama-2-7b-hf --eval_ppl \ --epochs 1 --output_dir ./log/Llama-2-7b-chat-w4a4 \ --wbits 4 --abits 4 --lwc --let \ --let_lr 1e-3 --alpha 0.75 Screenshot from 2024-09-06 19-14-35