Open hanrui1sensetime opened 9 months ago
I have found the script in SmoothQuant repo. So I will close the issue.
I have found the script in SmoothQuant repo. So I will close the issue.
Could you please share how the problem was solved?
Found it quite easily. For the further interest, one could find the code for scales generation here. All the needed info is described in README
We want to try QUIK on our self-implemented llama-like model weights. We found that may be there is no script about how to generate
act_scales
.pt files. So we use calibration data items to quant activation and save it first? How much items should we use, and need act_zeros too?I'm looking forward to the reply soon. Thanks.