IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
https://arxiv.org/abs/2210.17323
Apache License 2.0
1.92k stars 153 forks source link

act-order on inference #47

Open frankxyy opened 11 months ago

frankxyy commented 11 months ago

Hi, does act-order usage require inference change? Thank you