ikawrakow / ik_llama.cpp

llama.cpp fork with additional SOTA quants and improved performance
MIT License
89 stars 6 forks source link

iq2_k: slightly better bpw - accuracy compromise #20

Closed ikawrakow closed 2 months ago

ikawrakow commented 2 months ago

For LLaMA-3.1 models: