issues
search
ruikangliu
/
FlatQuant
Official PyTorch implementation of FlatQuant: Flatness Matters for LLM Quantization
MIT License
25
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Request for Quantized Model Availability
#4
JoesSattes
opened
2 days ago
0
Any plan to integrate it to TRT-LLM or VLLM
#3
lishicheng1996
opened
3 days ago
0
Benchmark on W4A16
#2
RanchiZhao
opened
4 days ago
2
Thank you for your interesting work. I want to ask when you will release the code.
#1
blap
closed
2 days ago
1