issues
search
ikawrakow
/
ik_llama.cpp
llama.cpp fork with additional SOTA quants and improved performance
MIT License
89
stars
6
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Merge mainline llama.cpp
#3
ikawrakow
closed
3 months ago
1
Offload Bitnet token embeddings to the GPU - the right way
#2
ikawrakow
closed
3 months ago
0
Offload Bitnet token embeddings to the GPU
#1
ikawrakow
closed
3 months ago
0
Previous