foldl / chatllm.cpp

Pure C++ implementation of several models for real-time chatting on your computer (CPU)
MIT License
357 stars 28 forks source link

F16 quantization not work #28

Closed foldl closed 2 months ago

foldl commented 2 months ago

Closed. This is a mistake.