Open luionTW opened 2 weeks ago
Can the team support the BitNet.cpp, that is another one pure 1 bit model. https://arxiv.org/pdf/2310.11453
The new quantization can help the edge computing improvement .
No response
Here's a fork of llama.cpp used in the Microsoft's BitNet repo. It would be great to upstream the changes.
Can the team support the BitNet.cpp, that is another one pure 1 bit model. https://arxiv.org/pdf/2310.11453
Motivation
The new quantization can help the edge computing improvement .
Possible Implementation
No response