issues
search
ilur98
/
DGQ
Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM
MIT License
12
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
请问论文图4中A16W4计算时使用的是什么内核
#3
Tmn07
opened
4 months ago
0
How to properly run W8A8?
#2
casper-hansen
opened
8 months ago
1
When the kernel implementation will be released?
#1
helloyongyang
closed
8 months ago
4