microsoft / T-MAC

Low-bit LLM inference on CPU with lookup table
MIT License
420 stars 32 forks source link

E2E integration into llama.cpp #7

Closed kaleid-liner closed 4 months ago