hiyouga / LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs
Apache License 2.0
25.52k stars 3.16k forks source link

npu支持GPTQ量化导出吗 #4594

Closed murray-z closed 5 days ago

murray-z commented 5 days ago

Reminder

System Info

Reproduction

### model
model_name_or_path: Llama3-8B-Chinese-Chat
template: llama3

### export
export_dir: Llama3-8B-Chinese-Chat-gptq-bit4
export_quantization_bit: 4
export_quantization_dataset: gptq_100.json
export_size: 5
export_device: cpu
export_legacy_format: false

Expected behavior

执行导出命令,一直没有报错,也没有相关生成。

Others

No response

hiyouga commented 5 days ago

不支持