NolanoOrg / sparse_quant_llms

SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
41 stars 3 forks source link