ikawrakow / ik_llama.cpp

llama.cpp fork with additional SOTA quants and improved performance
MIT License
89 stars 6 forks source link

Faster MoE inference #112

Closed ikawrakow closed 1 week ago

ikawrakow commented 1 week ago

This PR