byroneverson / llm.cpp

Fork of llama.cpp, extended for GPT-NeoX, RWKV-v4, and Falcon models
MIT License
28 stars 2 forks source link