mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
https://arxiv.org/abs/2211.10438
MIT License
1.1k stars 127 forks source link

ask for a function in linear.py for smoothquant in llama @Anizpz #71

Open msz12345 opened 6 months ago

msz12345 commented 6 months ago

in your script (named llama.py): it reads "from torch_int.nn.linear import W8A8BFP32OFP32LinearWithSFactor, W8A8BFP32OFP32Linear" But there is no such class or function named "W8A8BFP32OFP32LinearWithSFactor" in linear.py in torch-int/nn/linear.py at all!! ![Uploading image.png…]()