Open jpyo0803 opened 4 months ago
Hi,
I am wondering how to quantize llama3-8B with smoothquant.
What dataset did you use to generate activation scale?
Or do you plan to upload act_scales, model weights (to huggingface), and quantized version of source code(to github) for Llama3?
Thanks in advance!
Hi,
I am wondering how to quantize llama3-8B with smoothquant.
What dataset did you use to generate activation scale?
Or do you plan to upload act_scales, model weights (to huggingface), and quantized version of source code(to github) for Llama3?
Thanks in advance!